Gene Tery_4986 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4986 
Symbol 
ID4246641 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp7622202 
End bp7623740 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content35% 
IMG OID638109797 
ProductTRAP transporter solute receptor TAXI family protein 
Protein accessionYP_724373 
Protein GI113478312 
COG category[R] General function prediction only 
COG ID[COG2358] TRAP-type uncharacterized transport system, periplasmic component 
TIGRFAM ID[TIGR02122] TRAP transporter solute receptor, TAXI family 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.354924 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTAAATA ACATATACTC CCATCACCAA AAAATCAGTA AATTAATAGG AGGAATATTA 
TTATCAGCGA TCACATTAAC TAGCTGTACT CCCCAAAATC AGACCCTCAT ACTCTCAAGC
GGCGTTGAAG GTGGAGCATA CGAGCGTATT GGCAGACAAA TAATACATTC AGCTAGTGAT
GTTGGGAAAA TATCTATTGA CGACAATATA TCCCAGGGTT CACAGGAAAA CCTACAACGC
CTGCGCAATA GAGAAGCAGA TATTGCCCTA GTTCAATTAG ATGTAGCTAG CGAAGCTATG
AAAGCTGGAC AAATACAAGC AGTAGCTGTT CTTGCTAATG AATATTTGCA CCTCATTACC
CTATCTGACT CGGAAATAAA AACTTTTCAG GATCTCAAAG AAAAAAGAGT TAGCTTTAGT
ACTCCTGGCA GCGGTACTTA CTTTACTGCT AAACGACTAT TCAGTGCTAC AAATTTAAAA
ATCATAGAAG AAGAACTTGA TCTAGAACAA GGGTTTAATA AACTTAAACA AAGAGAAATA
GATGCTTTAG TTTATGTCGG TCCCTTGGGA GGCAGTAAAA AAGTGAAAAG CCAACTAACG
AGTCCTCCTA GTCTTAAATT AGTTCCCATA ACGACTTCTT TTATTAACTA TTTAACTATT
CAATTTCCAG AGTCATACCA AAGTACAGTC TTTCCCAAAG GTAGTTATAT GCCTCTATCA
CCAATACCGG AGAAAGATTT ACCGACTATT TCTACTGCTG CTGCTTTAGT AACTCGCCCT
GATGTTAGCA AAGAAAAAAT TGCTCTTTTA ACTTGGTCTT TGATTTCATC CTATCAAAAA
TACTCGTTAT TTGATCCAGA ATTTTCTCAG GACGACCCTG AGTCTTTGTT GAGTAATGGT
TTACTATATC TCCATCCTGG TGCTAAACTA TCATTTGAGC AAGGGGATCC TAGAAAAATT
TGGATGCGTT ACTTTGAGGA AAATCAAGAC TTACAAGCTA GTTTTATTAT TATTTTGTTT
ACAGGTATTC TCGGTTTTAT GCTACGAATG TGGCGTCATA AACAGTGTAA AAAGCTGATT
ATAAATACCC ATTTAGCTCT CAATGAAATA AGTATGTCTG CACAAACAAA TCCTAGTAAA
GCTCTTAAAG AGGTGGAAGA GTTAAGACAC CAACATCGGT TAATGTTGGT TGAAGGAAAT
TTGTCTAGAG AAGCTTATGA AAAACTAGAG CATATGACAC AAGATCTTGC TAATAAGTCT
CGAAAACTAC AAGAAAAACA ACATCGAAAA GATATTCAAA ATACAATAGC ATTAATTGAT
GAGTTTCAAA GTCTTTCAGG AATATGCCAA GAAAAAATGA GAGAAAGACT GGAAACTGCA
GCTAATAAAT ATAGGGAAAT GTTATTATTA AATCAAATTG ATATTCAGAC TTACATTCAT
CTAAACCAGC AGGTTCGTTC CTATCTCAAC CAAAATAATA ATCATTATTA TCAGGTTAAT
AATAATCGTC ATTTTCCTAT AGGTTCAGAG GATAATTGA
 
Protein sequence
MLNNIYSHHQ KISKLIGGIL LSAITLTSCT PQNQTLILSS GVEGGAYERI GRQIIHSASD 
VGKISIDDNI SQGSQENLQR LRNREADIAL VQLDVASEAM KAGQIQAVAV LANEYLHLIT
LSDSEIKTFQ DLKEKRVSFS TPGSGTYFTA KRLFSATNLK IIEEELDLEQ GFNKLKQREI
DALVYVGPLG GSKKVKSQLT SPPSLKLVPI TTSFINYLTI QFPESYQSTV FPKGSYMPLS
PIPEKDLPTI STAAALVTRP DVSKEKIALL TWSLISSYQK YSLFDPEFSQ DDPESLLSNG
LLYLHPGAKL SFEQGDPRKI WMRYFEENQD LQASFIIILF TGILGFMLRM WRHKQCKKLI
INTHLALNEI SMSAQTNPSK ALKEVEELRH QHRLMLVEGN LSREAYEKLE HMTQDLANKS
RKLQEKQHRK DIQNTIALID EFQSLSGICQ EKMRERLETA ANKYREMLLL NQIDIQTYIH
LNQQVRSYLN QNNNHYYQVN NNRHFPIGSE DN