Gene Tery_2637 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_2637 
Symbol 
ID4245362 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp4087746 
End bp4089296 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content31% 
IMG OID638107706 
Producthypothetical protein 
Protein accessionYP_722305 
Protein GI113476244 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.115981 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.485926 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATTC TTAGATATAA AAAGTTTCAA AATTTAGGTT TAGCAGATTT CAAATTTTTA 
TTTAACAGGC CAACACTACC TTATTTAATT ATTGCTTTTG GGGTAGCTAT GAGGTTTATT
CAATATTTAT CTAATCGTTC TCTCTGGGCA GATGAAGCAG TTTTAGCCTT AAATATTGTC
AATCGTTCTT ATTTAGAATT AATGCAACCT TTGGATTATG ATCAAGGTGC GCCCATAGGT
TTTTTAATAG TAGAAAAATT AGCAGTTCAG ATATTAGGTA ATAATGAGTA TTCCCTACGT
TTTTTTCCTT TTATCTGTGG TGTTTGCTCG TTGTTTTTAT TCTATGAATT AGGCAAAAAA
TGGAGTTCTA AATCTGCTAT TATTATTAGT TTGGCTTTAT TTGCTAGCCT ACAATATTTA
GTGTATTACT CAGCAGAAGT CAAACAATAT TCCAGTGATG TAGCGATCGC TCTGCTTTTA
TATGTACTAT TAATACCTTT GCTACAACAA AAATTGCACC GGGTTCAGAT AGTTAAATAT
TCTCTAGTGG GAGTAACTGC TATTTGGTTT TCTCATCCTT CTATCTTTAT TTTGGCAAGT
TTTGGTAGTA GTGCTTTACT AATTAATTTT TGGCAGAAAG AACTCAGTAA AATCAAACAG
TTATTACTCA TTTATTCAGC TTGGGTTTTA AGTTTTGCCA TCTTTTATTT CCTATCTTTG
ATAAATTTAA CGAGCAATGA AACTTTAACG ACTTCTTGGG AGGATGGTTT CCCTACTTCC
CCATTGGATA TTATTTGGAT GCTAGACGCT TTCGGTAAAT TTTTCTATAA ACCCTTAGGC
TTTAGCAAAT GGGTTGATGG ATTAGCCATT GTCGCTTTTT TAGTAGGTTG TATTTCCTGT
TGGTTGAGCA GAAAAAAAAT TTTGCTGCTC CTACTTTCTC CATTGTTGAT GACTTTTTTA
GCATCCTTTT TACATCAATA CCCATTTCGG AGTCGTTTGG TTCTATTTCT CACACCATTT
GTAATTTTTC TCATAGCAGA AGGCGGAAGT TATATTTTGA CAAAATCTAA ATTTAGACCA
ATTAAAATTA TAACTATTTT CTTGATTATT TTATTACTCA GACAACCTTT AGTAAAAGCC
ATTAAATTAA TAGAAAAACC TCTCAATTTA TCAGAAATAA AACCTGTGTT GAGCTATATC
AAAAAAAATC AACAACCAGG AGATATTTTG TATGTCTATC AACGAGGAAT ATATCAGTTT
CAGTATTATG CAGAAAAATA TGGTTATCAA GAAGGTGACT ATATTATTGG TGTGGATGAT
TTAGATAAGT TTGATGGTCA AGAATTATCA ATTACTGAGA TGACAAGATA TGAAAAAGAC
TTAGACAAAC TGCGGGGGAA TGAAAGAGTA TGGTTATTAT TTTCTCATAC TCATATTCCA
GCAGAAAGAA GATTTTTAAA CTATTATTTA AATGAAATTG GTCTCAGAAT AGATACTTTT
GAAAAACCTG GATCTTATGT ATATTTATAC GACATGAGTT ACCGAAATTA G
 
Protein sequence
MKILRYKKFQ NLGLADFKFL FNRPTLPYLI IAFGVAMRFI QYLSNRSLWA DEAVLALNIV 
NRSYLELMQP LDYDQGAPIG FLIVEKLAVQ ILGNNEYSLR FFPFICGVCS LFLFYELGKK
WSSKSAIIIS LALFASLQYL VYYSAEVKQY SSDVAIALLL YVLLIPLLQQ KLHRVQIVKY
SLVGVTAIWF SHPSIFILAS FGSSALLINF WQKELSKIKQ LLLIYSAWVL SFAIFYFLSL
INLTSNETLT TSWEDGFPTS PLDIIWMLDA FGKFFYKPLG FSKWVDGLAI VAFLVGCISC
WLSRKKILLL LLSPLLMTFL ASFLHQYPFR SRLVLFLTPF VIFLIAEGGS YILTKSKFRP
IKIITIFLII LLLRQPLVKA IKLIEKPLNL SEIKPVLSYI KKNQQPGDIL YVYQRGIYQF
QYYAEKYGYQ EGDYIIGVDD LDKFDGQELS ITEMTRYEKD LDKLRGNERV WLLFSHTHIP
AERRFLNYYL NEIGLRIDTF EKPGSYVYLY DMSYRN