Gene Tery_1324 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_1324 
Symbol 
ID4242475 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp2019402 
End bp2021213 
Gene Length1812 bp 
Protein Length603 aa 
Translation table11 
GC content36% 
IMG OID638106507 
ProductNa+/solute symporter 
Protein accessionYP_721118 
Protein GI113475057 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.54237 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTAA TTGATTGGCT AATTGTTTTA CTATACCTTA TTTTAACTAT GTGGATGGGG 
CTATATCTAT CTCGCAAAGG TGCAAAAAGT TTAGCAGACT TTTTTGTTTC TGGGCGATCG
CTTCCTTGGT GGTTAGCAGG TACAAGTATG GCAGCTACAA CTTTTTCCAT AGATACTCCT
CTTTATATTA CAGGGGTTGT AGCTAACCGA GGAATAGCTG GTAATTGGGA ATGGTGGATT
TTTGCTGTTT CTCATGTAAT TATGATTTAT ATTTTTGCTA AAATGTGGCG ACGCTCGGAA
GTAATTACTG ATGCTGAATT AACTGAAATT CGTTACGGCG GAAATATGGC AGCTGTTCTA
CGAGGAGTAA AAGCTTTTAT CTTTGCTATT CCTATGAATT GTATTGCTAT TGGTTATGCT
ATGTTAGCTA TGGTTAAAGT TGTTGAAGCA TTGCAAATTT GGCAAAGTTT AGGTATTGAT
GCTAATGCCA ATAATCTCAA ATTATTAAGT GTGATAGGTG TAAGTATATT TGTATTAATT
TATGCTGGTT TATCTGGTTT ATGGGGAGTA GTTGCTACAG ATTTTTTCCA ATTTTTTCTA
GCACTATTTG GGGCAATTAT TGTAGCGTTT TTTGCTGTTA ATAGTGTGGG TGGAATACAA
GAATTAATTA TTAAAATTCC TCAAGTTATC CCAGACAAAG ATATTTTATC ATTTGCACCT
TTTACTATTG GTGGAAATGG TAACTCTTGG ATTACTTTTA GCAAAACAGC AGGCATAACT
GCGAGTACAT TTTTTGCTTA TATTGCTATT CAATGGTGGT CTTGGCGCCG CAGTGATGGA
GGAGGAGAAT TTGTACAAAG ATTTGCTGCT GCAAAAAATG AAACAGAAGC AGAAAAAGCA
GCCTGGTTAT TTAATATAAT GCACTATGTT ATTAGAACCT GGCCTTGGAT TTTAGTTGCT
TTAGCTTCTA TTATAATTTA CCCCGATTTA CAAGATAGAG AATTGGGTTA TCCTAAATTA
ATGATTGATT TTCTTTCTCC AGGAATTTTA GGCTTAGTTG TTGCTTCTTT AGTGGCAGCA
TTTATGAGTA CAGTTTCTAC TTCTATTAAT TGGGGCGCTT CTTTTATTAC TAATGATTTA
TATCGAAGAT TTATTAAACC TGATGCTACT CAGGCAGAAT TAGTTTTGGT AGGACGACTT
TCTTCTCTAT TAGTAACAGT TTTAGGAGGA GTTGCTGCAT TTTTTGCTAA AGATGTGGCA
ACAGTATTTA GGTTAGTAAT AGCTATTGGA ACAGGTTCGG GGTTGGTATT AGTTTTACGT
TGGTTTTGGT GGCGAGTAAA TGCAGCAGCA GAATTAACTG CTATTGTCGC TAGCTTTTTT
GTGGGGATGG CAACTAGTTT ATTTCCTGCC TTCAAAATAG AAGATTTTGG TTTGCGAATT
ATCTTTATTA CTGTAACTGT AACAGTTTTG TGGGTGGTAG CAATGTTGAT TACACCGCAA
GAGTCTGATG CTACTTTAGA AGACTTTTAT CGGCGATCGC TCCCCGGTGG TCCTGGTTGG
CAACGGCAAA GAGTAAGCAC TGGTTTAGCT CCTGCACAAA ATTTGGGTAA GGATTTACAA
AAGGTTCTGG CATCTATATT ATTATTGTTT GGAGCGTTGT TGGCGACTGG TGGTTTCCTG
TTACTTAAGC CAACTATTGG TTGGATATTT ATGATTATTG CTGTTTTCAG CGGGATGTGG
TTGCGACAAC TCAATAAGTC TAAAATTTTA CCTATGCCAA GACCAGGGTT AGATGATGAA
GATTTGTTAT AA
 
Protein sequence
MKLIDWLIVL LYLILTMWMG LYLSRKGAKS LADFFVSGRS LPWWLAGTSM AATTFSIDTP 
LYITGVVANR GIAGNWEWWI FAVSHVIMIY IFAKMWRRSE VITDAELTEI RYGGNMAAVL
RGVKAFIFAI PMNCIAIGYA MLAMVKVVEA LQIWQSLGID ANANNLKLLS VIGVSIFVLI
YAGLSGLWGV VATDFFQFFL ALFGAIIVAF FAVNSVGGIQ ELIIKIPQVI PDKDILSFAP
FTIGGNGNSW ITFSKTAGIT ASTFFAYIAI QWWSWRRSDG GGEFVQRFAA AKNETEAEKA
AWLFNIMHYV IRTWPWILVA LASIIIYPDL QDRELGYPKL MIDFLSPGIL GLVVASLVAA
FMSTVSTSIN WGASFITNDL YRRFIKPDAT QAELVLVGRL SSLLVTVLGG VAAFFAKDVA
TVFRLVIAIG TGSGLVLVLR WFWWRVNAAA ELTAIVASFF VGMATSLFPA FKIEDFGLRI
IFITVTVTVL WVVAMLITPQ ESDATLEDFY RRSLPGGPGW QRQRVSTGLA PAQNLGKDLQ
KVLASILLLF GALLATGGFL LLKPTIGWIF MIIAVFSGMW LRQLNKSKIL PMPRPGLDDE
DLL