Gene Tery_4362 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4362 
Symbol 
ID4246015 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp6723030 
End bp6725207 
Gene Length2178 bp 
Protein Length725 aa 
Translation table11 
GC content39% 
IMG OID638109250 
Producttetratricopeptide TPR_2 
Protein accessionYP_723827 
Protein GI113477766 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3063] Tfp pilus assembly protein PilF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.714509 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAGAAC CACTAACATT AGAACTATAT AGAAAAATAG GGATTGCATA CCATTCGCTA 
GGAAAATTTG AACGCTCCAT TGAATATTTT CAGCAACAAT TGATTATTGC TCGCGAAATA
AAAGATCGGC AATCTGAAAG TCAAGCATTA GGTAATTTAG GAATTGTTTA TCAGTCACTA
GGGCATTTTG AACGCTCCAT TGATTCTTTT CAACAACAAT TGCCGATTAC TCGTGACATA
AAAGATCGGC AATCTGAAAG TCAAGCATTA GGTAATTTAG GAATTGTTTA TCAGTCACTA
GGGCATTTTG AACGCTCCAT TAATTATTTT CAACAACAAT TGGTGATTAC TCGCGAAATA
AAAGATCGGC AATCAGAAAG TAAAGCCTTA GGTAATTTAG GAATTGTTTA TCAGTCACTA
GGGCATTTTG AACGCTCCAT TGATTCTTTT CAACAACAAT TGGCGATTAC TCAAGAAACA
AAAGATAGGC AATCAGAAAG TAAAGCCTTA GGTAATTTAG GAATTGTTTA TCAGTCACTA
GGGCATTTTG AACGCTCCAT TAATTATTTT CAACAACAAT TGGCGATTAC TCGCGAAATA
AAAGATCGGC AATCAGAAAG TAAAGCCTTA GGTAATTTAG GCATTTGTTA TGAAAATCAA
GGGCAATATA GCAAAGCCGA ACCCTTGTAT CTCGAAGCAT TAAAGATGAA CAAGCAACTG
TTAGGGTCAA CACATCCGGA TGTGGCCTCT AATCTTAATA ACCTGGCTGG TCTCTACTAT
TCCCAAGGGC GATATAGCGA AGCCGAACCC TTGTATCTCG AAGCATTAAA GATGAACAAG
CAACTGTTAG GAGCAACACA TCCAGAAATT GCTTCTAATC TTAACAACCT GGCTGGTCTC
TACTATTCCC AAGGGCGATA TAGCGAAGCC GAACCCTTGT ATCTCGAAGC ATTAAAGATG
AACAAGCAAC TGTTAGGAGC AACACATCCA GAAATTGCTT CTAATCTTAA CAACCTGGCT
ATTCTTTATC GTTCTCAAGG GCGATATAGT GAAGCCGAAC CTTTGTACAA ACAAGCGATA
GAAATTAATA AAATCGCCCT ACCAGTAAAT CATCCCTCTC GTGCTTCGAG TCTCAACAAC
TTAGCTGGTC TTTACTCTAA CCAAGAGCGA TATAGTGAAG CCGAACCTTT GTACAAACAA
GCGATAGAAA TTAATAAAAT CGCCTTACCA GCAAATCATC CCTTTCTTGC TTCGAGTCTC
AACAACTTAG CTAGTCTTTA CTTTAACCAA GGGCGATATA GCGAAGCCGA ACCTTTGTAC
AAACAAGCGA TAGAAATTAA TAAAATTGCC CTATCAGCAA ATCATCCCTC TCTTGCTTTC
AATCTCAACA ACTTAGCTGG TCTTTACTCT AACCAAGGGC GATATAGCGA AGCCGAAGCT
TTGTACAAAC AAGCGATAGA AATTAATAAA ATTGCCCTAC CAGCAAATCA TCCCTCTCTT
GCTTCGAGTC TCGAGAACTT AGCTGCTCTT TACTTTAACC AAGGGCGATA CAGCGAAGCC
GAAGCTTTGT ATAAACAAGC GATAGAAATT AATAAAATCG CCTTACCAGA AAATCATCCC
TCTCTTGCTT CGAGTCTCGA TAACTTAGCT GCTCTTTACT TTAACCAAGG GCGATACAGC
GAAGCCGAAG CTTTGTACAA ACAAGCGATA GAAATTAATA AAATTGCCCT ACCAGCAAAT
CATCCCTCTC TTGCTTCGAG TCTCGATAAC TTAGCTACTC TTTACTTTAA CCAAGGGCGA
TACAGCGAAG CCGAAGCTTT GTACAAACAA GCGATAGAAA TTAATAAAAT TGCCCTACCA
GCAAATCATC CCTCTCTTGC TTCAAGCTTC ATCAACTTAG CTGGTCTTTA CTCTAACCAA
GGGCGATATA GCGAATTTGA AGATACAATA GCCGCTCTCA GAGAAGACTT GAAAACACGA
AATCATTTAA GTAATTTTTG TAAAATAGTC GAAAATTATC TCCAGGATAC AGACTTAAGC
ACATTTGTAG AAAAATATTA TTCTGAGATT GCAGAATCAG GCTACAATAG AGAAATTGAC
GCCCTTGTGA ATAACCTGGA CCGACATGGG CATATAGAAT TAGCACTCAA CTTATTAGAA
TCAATAAAAA AACAGTAA
 
Protein sequence
MPEPLTLELY RKIGIAYHSL GKFERSIEYF QQQLIIAREI KDRQSESQAL GNLGIVYQSL 
GHFERSIDSF QQQLPITRDI KDRQSESQAL GNLGIVYQSL GHFERSINYF QQQLVITREI
KDRQSESKAL GNLGIVYQSL GHFERSIDSF QQQLAITQET KDRQSESKAL GNLGIVYQSL
GHFERSINYF QQQLAITREI KDRQSESKAL GNLGICYENQ GQYSKAEPLY LEALKMNKQL
LGSTHPDVAS NLNNLAGLYY SQGRYSEAEP LYLEALKMNK QLLGATHPEI ASNLNNLAGL
YYSQGRYSEA EPLYLEALKM NKQLLGATHP EIASNLNNLA ILYRSQGRYS EAEPLYKQAI
EINKIALPVN HPSRASSLNN LAGLYSNQER YSEAEPLYKQ AIEINKIALP ANHPFLASSL
NNLASLYFNQ GRYSEAEPLY KQAIEINKIA LSANHPSLAF NLNNLAGLYS NQGRYSEAEA
LYKQAIEINK IALPANHPSL ASSLENLAAL YFNQGRYSEA EALYKQAIEI NKIALPENHP
SLASSLDNLA ALYFNQGRYS EAEALYKQAI EINKIALPAN HPSLASSLDN LATLYFNQGR
YSEAEALYKQ AIEINKIALP ANHPSLASSF INLAGLYSNQ GRYSEFEDTI AALREDLKTR
NHLSNFCKIV ENYLQDTDLS TFVEKYYSEI AESGYNREID ALVNNLDRHG HIELALNLLE
SIKKQ