Gene Tery_4844 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4844 
Symbol 
ID4246498 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp7437613 
End bp7439352 
Gene Length1740 bp 
Protein Length579 aa 
Translation table11 
GC content37% 
IMG OID638109680 
Producthypothetical protein 
Protein accessionYP_724256 
Protein GI113478195 
COG category[R] General function prediction only 
COG ID[COG0433] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.415691 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTTTG AAAGACCATT AGGCTCAGTA ATTCAAGGTT CCCTCAGTCA AGGATTAGAA 
GTACGACTCC ACCCGGACGT ATTGGTAGAA GATATGCGGG TTGGTAAATT TTTAGTTGTA
CAAGGAGTAC GCGCCCATTT TTTCTGTATG CTAACCGATG TTTTATTAGG AACATCTAGC
GAACGAATAA TGATTAATCC ACCCCTACCC ACAGATGATT TTTTGCAATC TGTTTTAGCT
GGAGGTAGTA CCTACGGAAC TATCGAACTC GCTCCAATGT TAATGTTAGC TATTGCTCCA
GAACAATTAC CAGACTCTTT CAATTTCAAC AACACAAATG ATAATCAAAA AAAATTAAAG
TCAACCCAAA ATTTAGCATC CTTTGAAGCT CAAAGCAGTT CTCAAATTAA ATTAATGCCA
GTCAAAACTA TTCCTAGTCA TTTTTCCCAA GTATTTGAAG CAAGTGTCAG AGATTTTAGT
TTAGTTTTTG GCAGGGAAGA CGACCCGACT CGCCGGAATT TTGCTGTGGG TAAACCTATT
GATATGGATG TACCTGTTTG TTTAGATTTA GATAGATTTG TAGAACGAAG TAATGGGATT
TTTGGTAAGT CGGGAACTGG TAAATCTTTT CTGACTCGTT TACTTTTATC TGGCATTATT
CGGAAAGGTG CTGCCGTAAA TTTAATTTTT GATATGCACT CCGAATATGG TTGGGAAGCA
ATTGCAGAAG GAAAACAAGT TAATACTGTA AAAGGTTTAC GACAATTATT TCCCGATCGA
GTCGAACTTT GGACGCTTGA CCCAGAATCT ACTAGACGTA GAGGGGTGCA TGATGCACGA
GACCTGTATT TAAGTTATAA CCAAATTGAG GTTGAAGATA TTGGGTTAGT GCAACGGGAG
TTAAATTTAT CTGAGGCAAG TATTGATAGT GCAAATATTC TCCGCAGTGA ATTTGGTAAA
TCTTGGATTA CTAAATTATT GGCAATGACT AATGAAGATA TTCAAATATT TTGTGATGAA
AAAAGAGGTC ATAAAGGTTC GATTATGTCG TTGCAAAGAA AGTTGTTACG ACTGGATAAT
CTGAAATATA TGCAGACAAA AAATACCAAC AATTATATAG AAGAAATCTT AGAATCTTTA
GATGCAGGTA AGCACGTTAT TATTGAATTT GGTTCCCAGT CAAATATGCT TTCCTATATG
TTGGCAGCTA ATATGATTAC TCGCCGAATT CATAATAGTT ATGTACGGAA AGCAGATAAA
TTTTTGAGTT CTAAAAATCC GAGCGATCGC CCTCAACCAT TAGTTATAAC TATTGAGGAA
GCTCATCGTT TTCTTGATCC TGCGATAGTA CGTTCAACTA TTTTTGGTAC GATAGCTAGG
GAGATGCGGA AATATTTTGT GACTCTATTG GTTGTAGACC AGCGACCTTC GGGAATAGAT
GCGGAAGTTA TGTCTCAAAT TGGTACGAGA ATTACAGCTT TGTTGAATGA TGATAAGGAT
ATTGATTCTA TTTTTACAGG AGTTTCTGGA GGTCATAGTT TGAGGTCTGT TTTGGCAAAG
TTGGATTCTA AACAGCAAGC TTTGGTATTA GGTCATGCGG TACCAATGCC TGTGGTAATT
CAAACTCGTG CTTATGATCA GACTTTTTAT CAGGAAATTG GAGATACGGA TTGGCGGTAT
GTTTCTGATA ATGAGTTATT TGAAGCGGCT CAAGTGGCTT TGGATGATAT TGGGTTTTAG
 
Protein sequence
MDFERPLGSV IQGSLSQGLE VRLHPDVLVE DMRVGKFLVV QGVRAHFFCM LTDVLLGTSS 
ERIMINPPLP TDDFLQSVLA GGSTYGTIEL APMLMLAIAP EQLPDSFNFN NTNDNQKKLK
STQNLASFEA QSSSQIKLMP VKTIPSHFSQ VFEASVRDFS LVFGREDDPT RRNFAVGKPI
DMDVPVCLDL DRFVERSNGI FGKSGTGKSF LTRLLLSGII RKGAAVNLIF DMHSEYGWEA
IAEGKQVNTV KGLRQLFPDR VELWTLDPES TRRRGVHDAR DLYLSYNQIE VEDIGLVQRE
LNLSEASIDS ANILRSEFGK SWITKLLAMT NEDIQIFCDE KRGHKGSIMS LQRKLLRLDN
LKYMQTKNTN NYIEEILESL DAGKHVIIEF GSQSNMLSYM LAANMITRRI HNSYVRKADK
FLSSKNPSDR PQPLVITIEE AHRFLDPAIV RSTIFGTIAR EMRKYFVTLL VVDQRPSGID
AEVMSQIGTR ITALLNDDKD IDSIFTGVSG GHSLRSVLAK LDSKQQALVL GHAVPMPVVI
QTRAYDQTFY QEIGDTDWRY VSDNELFEAA QVALDDIGF