Gene Tery_4787 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4787 
Symbol 
ID4246441 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp7357523 
End bp7359022 
Gene Length1500 bp 
Protein Length499 aa 
Translation table11 
GC content39% 
IMG OID638109635 
Productamino acid carrier protein 
Protein accessionYP_724211 
Protein GI113478150 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1115] Na+/alanine symporter 
TIGRFAM ID[TIGR00835] amino acid carrier protein 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.617016 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAGAC AAACTTGGCT ATATATACTC CTATTGATAT TAACACCAAC TGTTGTTTTA 
GCAACACAAC CAGAACAAGG AGATAGTCTC ATAAACATCA TCGATCAAGC TTTTTCGAGT
TTTGTAGATG TAATATTTAA CATCCTCTTC TTTAGTATCG GTGGTTTTCC TTTAATTATT
TTATGGCTAA TTATTGGCGC ATTTTTTTTT ACTATTAGGA TGCAATTTAT TAATATTCGT
GCTTTTACAC ACGCCATTGA GATCCTTAGA GGTAAATATG ATGATCCCGA AGCTCAAGGA
GAAGTTTCAC ATTTTCAAGC TTTAGCAACA GCATTATCAG GAACAGTAGG ATTAGGAAAT
ATCGCTGGTG TAGCTGTAGC AGTGAGTATG GGGGGAGCTG GTGCTGTCTT CTGGATGACG
ATCGCTGGTT TTTTTGGGAT GACTAGTAAA TTTATAGAAT GTACCCTGGC TCAAAAATAT
CGCATTGTTA AACCTGATGG CACTATTTCT GGTGGGCCGA TGCGCTACCT ATCCGCAGGT
TTAGCAGAAA TGGGGCAGGG AACCTTAGGC AAAGTTCTCG CTGTATTATT TTCTATCTTC
TGTATTTCTG CTGCTTTTGG TGGTGCTAAT ATGTTTCAGG CAAACCAGTC TTATGGCGCC
GTTTCAAATG TATTACCAGG TTTACCTAGT TGGGTTTATG GCTTAGTGCT AGTAGTTTTG
GTCGGATTAG TTATTATTGG TGGTATTCAG CGTATTGGTA TGGTGGCAGG TACTCTTGTA
CCTTTAATGT GTCTGCTCTA TGTTTTGGCT TGCCTATTTA TTCTTCTGGC TAATTTTACC
CAAATTCCAG GAGCGATCGC TACTATTATT TCTGGTGCTT TTGCCCCCCA AGCAGTAGAG
GGTGGGATTA CTGGTGTAAT TATTCAGGGA TTTCAACGTT CGGCTTTTTC TAATGAAGCT
GGTGTCGGTT CCGCAGCGAT CGCTCACTCT GCTGCTAGAA CTGATGAACC TATTCGTGAA
GGCTTAGTCG CCCTGTTAGA ACCATTTATT GATACTATTG TGGTTTGCAA TATGACCGCT
TTGGTAATTG TTATTACTAA GGTATATAAT GCTGAAGAAT TCTCTGCTTT GCGGGAAGCA
AATAAAGGGG CAGAATTAAC TTCTGCAGCT TTTGGTACTG TTTTGGCATG GTTTCCAGTT
CTATTAGCGA TCGCTGTTTT CTGTTTTGCG TTTTCTACCA TGATTTCTTG GAGTTACTAC
GGCGAACGTT GTTGGGATTA TTTATCTGAT GGTAAGGGAT TAATTATTTA CAAAATATTG
TTTTTGATTG CTACTTTTGT CGGTTCAGTA TCTAACCCAT CCTCTGTAAT TAATTTTAGT
GATGCTACTC TACTTTCAAT GGCTTTTCCT AATATTTTGG GTGGGTATTT TTTGTGTAGT
CCAGTAGCAA AGGATTTGCA AAATTATATG GAGCGTCTTA GAACAGGAGC CTTGACTTAA
 
Protein sequence
MKRQTWLYIL LLILTPTVVL ATQPEQGDSL INIIDQAFSS FVDVIFNILF FSIGGFPLII 
LWLIIGAFFF TIRMQFINIR AFTHAIEILR GKYDDPEAQG EVSHFQALAT ALSGTVGLGN
IAGVAVAVSM GGAGAVFWMT IAGFFGMTSK FIECTLAQKY RIVKPDGTIS GGPMRYLSAG
LAEMGQGTLG KVLAVLFSIF CISAAFGGAN MFQANQSYGA VSNVLPGLPS WVYGLVLVVL
VGLVIIGGIQ RIGMVAGTLV PLMCLLYVLA CLFILLANFT QIPGAIATII SGAFAPQAVE
GGITGVIIQG FQRSAFSNEA GVGSAAIAHS AARTDEPIRE GLVALLEPFI DTIVVCNMTA
LVIVITKVYN AEEFSALREA NKGAELTSAA FGTVLAWFPV LLAIAVFCFA FSTMISWSYY
GERCWDYLSD GKGLIIYKIL FLIATFVGSV SNPSSVINFS DATLLSMAFP NILGGYFLCS
PVAKDLQNYM ERLRTGALT