Gene Tery_4421 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4421 
Symbol 
ID4246074 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp6815319 
End bp6816425 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content40% 
IMG OID638109305 
ProductTRAP dicarboxylate transporter- DctP subunit 
Protein accessionYP_723882 
Protein GI113477821 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG4663] TRAP-type mannitol/chloroaromatic compound transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGAA GAAAACTCAT TAACCATATC GCAACTGCTA CAACTGTCGC AACTTTAGCA 
GCTTGTAATC AAACTACCAC TCAATCTGTT TCTACTGATA GCTTACCTTC TGTAAAATGG
AGAATGACCA CAAGTTGGCC TCGCTCCCTA GATACAATTT TTGGTGGGGC ACAAACAATA
TGCGATCGCG TCGCAGCTAT GACCAACGGC AGATTTACCA TTACTTCCTA TTCTGCAGGA
GAAATAGTCG GAGGTTTAGA AGTTCTAGAC GCTGTCCAAC AAGGTACAGT ACAATGTGGC
CATACTGCAA GTTACTACTA TGTAGGCAAA AACTCAGCCC TTGCCTTTGG TACCTCTGTC
CCCTTCGGAT TAACGGCCCA ACAACAAAAT GCTTGGTATT ATCATGGTGG AGGACTAGAA
ATCATGCATA AGCTGTACTC CGACTTTAAT ATTATTAATT TTCCTGCTGG TAACAGTGGA
GTCCAGATGG GAGGTTGGTT CAGACAAGAA ATTAACACTG TTAGTGATCT AAATGGTTTA
AGTATGCGCA TCCCTGGTTT CGGTGGTGAG GTCATGAAAA AATTAGGTGT AAATGCTCAA
GTATTACCTG GTGGTGAAAT TTATTTAGCT TTAGAACGAG GTGCCCTTGA TGCTGCGGAG
TGGGTAGGTC CTTATGATGA TCAGAAATTA GGCTTACAAA AGGCAGCAAA ATATTATTAT
TATCCTGGTT GGTGGGAACC CGGTCCTACT TTTGAGGTGC AAATTAATCT CAATGAATGG
AATAAACTAC CAAAGGAATA TCAAGAAGTT TTGAAAACTG TTGCATACCA AGCTAACATT
AGTATGCTTG CCCAATATGA TGCTTTAAAC GGAGTAGCAT TAGCAGAATT AATAGCAGGC
GGTACAGAGT TACGCCCCTA CAGTCAAGAA ATTTTACAGG CAGCACAGAA AGCTGCTGTT
GAATTTTACG AAGAAAAAGC TACTCAGGAT ACAACTTTTA AAGAAGTCTA TGAACAGTGG
AAAAAGTTCC GAGAACAGAT TTATAAATGG AATACTATTA ATGAATTGAG TTTTGCTCAG
TTTACCTTCA ATAACTCCCT AAAGTAA
 
Protein sequence
MKRRKLINHI ATATTVATLA ACNQTTTQSV STDSLPSVKW RMTTSWPRSL DTIFGGAQTI 
CDRVAAMTNG RFTITSYSAG EIVGGLEVLD AVQQGTVQCG HTASYYYVGK NSALAFGTSV
PFGLTAQQQN AWYYHGGGLE IMHKLYSDFN IINFPAGNSG VQMGGWFRQE INTVSDLNGL
SMRIPGFGGE VMKKLGVNAQ VLPGGEIYLA LERGALDAAE WVGPYDDQKL GLQKAAKYYY
YPGWWEPGPT FEVQINLNEW NKLPKEYQEV LKTVAYQANI SMLAQYDALN GVALAELIAG
GTELRPYSQE ILQAAQKAAV EFYEEKATQD TTFKEVYEQW KKFREQIYKW NTINELSFAQ
FTFNNSLK