Gene Tery_5007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_5007 
Symbol 
ID4246662 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp7648976 
End bp7650172 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content31% 
IMG OID638109817 
Producthypothetical protein 
Protein accessionYP_724393 
Protein GI113478332 
COG category[C] Energy production and conversion 
COG ID[COG1413] FOG: HEAT repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATAGACT ATAAATCCAC TTTTTTGATT ATTACATGGT TTACTTTCTT TTGCTTTTCT 
CCCACTGCGA GTAGAGCGAC AATTGATGAT TCCAGCATAG GTAAATTTAG TAATCGTACC
TTGAAAGAAA CAGAAAACGT ACAAGAATCT CATCAAGTTA ATTTTCTAGT TCCTCTATAT
ATTGCTCAAG ATGGTGACTC AAAAGCTAAG GAATATCAAA ATAAAAAATC AAATTCTTCT
GATTCAAAAG TACTATTACT AATTAGTTCT ATCCTCAAAG CTTTAGGAAA TCAAAAAACA
CTATTTCTTT TGATAAGTAT AATATCTGTT CTAGTCACTA GCATAGCGGT TTTCCTTTTA
CTCAAGTTGT TTGACGATAA TGAGCCTAAA GATTTAGATC CAGAGGAAAT ATCAGAAGTA
AATTCATCTG ATTTAAAAAA TAACTATCCT CAACTCAATA GCCAAAAGTC CTCAGTATCT
TCCGAAAACT ATTCCGTGGC TTATACTGAA AATTCTTTAG ATTATTTTGT TCCTGAAGCA
CAAAAAGACT TAGGGGAAAA TACTTATGTA CCAATTCCCC AAAATAGCTC CAAATCACAT
TATCAATATC AAGAAATTGA AGCAGAAACA TTTCCTATAG TTGAGAGTAA TTCTTCTATA
GATCGCAAAG AAGATAATAA CTATGAGCAA AAAGAAACTC AACTAAATTC ATCAAAAACA
ATTGATAAAA ATTATATCGT TCAAGATTCT AACTTTGACA ATATTACTCA CAAAAATAAT
TATTCTTGGC CTGAAGTTAA TATTGTTGAG CAGTTGATTC ACGAATTGCA AAATTTTGAT
CCTAATAAAC GACATCAAGC TATTTGGAAG CTTGGTCAAA AAGGAGATTC TAGAGCTGTT
CAACCATTAG TTAATTTACT CATAGATTCT GATTCTAAAC AGCAAAGTTT AATTTTAGCA
ACTCTCTCAG AAATTGGTAC TAGAACATTA AGACCAATGA ATAGAGCTTT AGCAATATCA
CTACAGAATG ATAATGCTGA AGTAAGAAAG AATGCTATCC GAGATTTAAC ACGAATTTAT
GAGTTAATAA TTCAAAGCAC CAACTTATTA CAACAAGCAG AGTATGACTC AGATCCAGAA
GTTGAAGAAA CAGCAAAATG GGCTTTAAAA AAGTTAAATA GAACCAATAT AAGATGA
 
Protein sequence
MIDYKSTFLI ITWFTFFCFS PTASRATIDD SSIGKFSNRT LKETENVQES HQVNFLVPLY 
IAQDGDSKAK EYQNKKSNSS DSKVLLLISS ILKALGNQKT LFLLISIISV LVTSIAVFLL
LKLFDDNEPK DLDPEEISEV NSSDLKNNYP QLNSQKSSVS SENYSVAYTE NSLDYFVPEA
QKDLGENTYV PIPQNSSKSH YQYQEIEAET FPIVESNSSI DRKEDNNYEQ KETQLNSSKT
IDKNYIVQDS NFDNITHKNN YSWPEVNIVE QLIHELQNFD PNKRHQAIWK LGQKGDSRAV
QPLVNLLIDS DSKQQSLILA TLSEIGTRTL RPMNRALAIS LQNDNAEVRK NAIRDLTRIY
ELIIQSTNLL QQAEYDSDPE VEETAKWALK KLNRTNIR