Gene Tery_2845 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_2845 
Symbol 
ID4245148 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp4425850 
End bp4427274 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content34% 
IMG OID638107895 
ProductNa+/solute symporter 
Protein accessionYP_722492 
Protein GI113476431 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCAAAA ACCTAATTCC CTACTTATTT ATTAGTGCAT ATTTACTCCT GACTTTAGTC 
ATCGGAATTA TTGGTTATCG TAGCCAAAAA AATACACCAG AAGATTACTT TCTAGCAGAG
CGAAAAATCG GTTCCATAGT TCTATTTTTT ACCCTCATTG CTACCAACTT TAGTGCCTTT
GCTTTTCTCG GACTTTCTGG TGCTGGTTAT CGTATTGGTA TAAGTTATTA TCCGATGATA
GGATTTGGTA CGGCCCTTGT CGGAATTACA TTTTATTTTA TCGGTTATAA AGTTTGGCTT
TTAGGTAAAG AAAAAGGTTT AATTACTCCC TCAGAATTAA TCGCAAATTG CCTCCCAAGT
CAACCACTAA AACTAATATT TTTAGCCGTG ATGGTAATTT TTACCATACC TTATTTAACC
CTACAACCAA TTGGTGGGGG TTATATGATT GAAAATTTAA CTAATGGGCA AATTCCCTAT
TTTTGGGGAG CAGCTTTTTT AACTTTCATC ATTGTTTTAT ATGTGTTTAT CGGGGGAATG
AAAAGTGTAG CTTTAACCGA TGTTTTACAA GGAGTATTAA TGTTGATTTT AATGATAGTT
GCAGTAGTGA CTATTTCCCA AAGTCTCGGT GGCTTGACTG AAGCTAATCA AACTGTTCAT
AAACTCAAAC CAGAGCTTTT TTCTCGTTCT GGTACCAATG ATTTTTTCAC TCCCCACAAA
TGGTTTAGTT ATATGTTGCT TTGGGGTTTA AGTGTACCGA TGTTCCCCCA AATGTTTATG
CGCTTTTATA CTCCTAAAAA TCCTAATTCC CTCAAACTTT CTGCTAGTTT ATATCCATTA
ATTACTTGCA TAATGTTTAT TTGCCCTGTT TTAATTGGAA TGTGGGGTCA TATTAATTTT
CCTGATTTAG TTGGAAAAGA AGCAGATAAA ATTTTTCCCA TGATGTTAGC TGAATATACC
TCAACTACCA TGGCTTCATT AGTAATGGTA GGAGCTTTAG CAGCGTTTAT GTCAACCCTT
GATTCTCAAC TTTTAGCATT AAGTTCTATG ATTACTAGGG ATATTTATAT TGTTTATATT
CGCCCCCAAG CAACCCTGAC AGAACAAACT TTTGTGGGCA AAATTTTAAT TGTTATTTTA
GCAATAATTG GTTTGATTTT AGCAGCAAAT CCTCCGGCAA CAATTGCAGC GATCGCTACT
CAAGCTTTTA CAGGTTTAGC AGTATTATTT CCCACAGTTA TTGCCGCTTT ATATGGGAAA
AATATTTCAC CTTTGAGTTG TATAATTTCC ATTATTTTTG GAGAAATAGC AGTTATTGGT
TTTCAGGTAA ATATTATTCC TAAAAGTTGG GCTTTAGGAT TTCTGCCAGT AGTGCCGATA
GTGTTAATTT GTAGTTTAAT TATTGTCGTT TTTTTGAGGA AGTAG
 
Protein sequence
MIKNLIPYLF ISAYLLLTLV IGIIGYRSQK NTPEDYFLAE RKIGSIVLFF TLIATNFSAF 
AFLGLSGAGY RIGISYYPMI GFGTALVGIT FYFIGYKVWL LGKEKGLITP SELIANCLPS
QPLKLIFLAV MVIFTIPYLT LQPIGGGYMI ENLTNGQIPY FWGAAFLTFI IVLYVFIGGM
KSVALTDVLQ GVLMLILMIV AVVTISQSLG GLTEANQTVH KLKPELFSRS GTNDFFTPHK
WFSYMLLWGL SVPMFPQMFM RFYTPKNPNS LKLSASLYPL ITCIMFICPV LIGMWGHINF
PDLVGKEADK IFPMMLAEYT STTMASLVMV GALAAFMSTL DSQLLALSSM ITRDIYIVYI
RPQATLTEQT FVGKILIVIL AIIGLILAAN PPATIAAIAT QAFTGLAVLF PTVIAALYGK
NISPLSCIIS IIFGEIAVIG FQVNIIPKSW ALGFLPVVPI VLICSLIIVV FLRK