Gene Tery_2349 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_2349 
Symbol 
ID4245231 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp3628413 
End bp3630362 
Gene Length1950 bp 
Protein Length649 aa 
Translation table11 
GC content42% 
IMG OID638107442 
ProductTPR repeat-containing protein 
Protein accessionYP_722042 
Protein GI113475981 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3063] Tfp pilus assembly protein PilF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0186841 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACATA TTACTAAACT AACTGTTGGG TTAATTACTA TTGCTCTGGT CAGCTGCAAT 
AAATATGAAT CACAAATAAG TACAGATGTT TCTACTAAAA TTTCTAGACC TGATCCAATT
ATTACTCCTA GACAAGACAA TGCTCCTTTC AACCAAATTT ATATATATAG ATTAGCTGAA
AATATTACTG TAGAAGTTGT TGCTCGTAGT ACTGTTTCTG GAGGTGACCC TTTTAGCTCG
TCAGGTTCGG GGGTTATTAT TGGTAAACAA GGTTCTACTT ATTATGTGCT CACTGCCAAG
CATATTTTTC AGTACCAAGA TGACTATCGG GTAGTTGTTC GCAGTAAAAA ACCGGGGGAG
GATGCGGAAA TTCTGAAGCT AGAAATTATC TACCGTTATC CAGATGCAGA TTTGGCTGTG
TTTAAGTTTG CTAGTGTGAA GCAATATAAG GTGGCTGAAG TAGGGGAAGC GAGTCAGTTA
AGGAAAAATA GTGAAGTTTA TGTGGGGGGT TGGCCTGGGG CTGAGAATAG GGAAGGTTTT
CAGTTCACTC CGGCGAAGGT GACTAACCCA CGGGCAGGAG ATCTTTTAAA TTATGAGCCT
ACTGTACCTG GTGAGGGTGT ATATCCTGGT ATGAGTGGGG GGGCGGTTTT GAATAAGGCG
GGGCAGTTGA TGGGTATTCA TGTGGGGTTG ACAAAGGCTG GAGATGGGGA GGGGGTTTTG
GTTTCGACTT TTTTGCGGGA TATACCACAG CAGGTGAGCA GGGTTTTGGT GAGGTCAACT
TCTGCTGCTT TGCCTTCTCC GATACCTTCC CCTAAAAATC GGGCTGTTGG AAATACTGAT
AATGTTACTC CATCTCAAAT TCAGAATGCC GAAAGTTACT ATGAACAAGG AGATAAACAC
CATGATAGGG AAGAATTTGA ACAGGCGCTC GCTGACTATA ACCAAGCCAT TCAACTTAAT
CCCAAATATG CTGATGCTTA CAACAACAGG GGTATTGTTT ACCGTAAGCA GGGAAAATAT
GATTTGGCGC TCGCTGACTT AAACCAAGCC ATTCAACTTA ATCCCAAATA TGCTGATGCT
TACAAAAACA GGGGTAATGT TTACTATAAC CAGGGAAAAT ATGATTTGGC GCTCGCTGAC
TATAACCAAG CCATTCAACT TAATCCCAAA TATGCTGAAG CTTACAACAA CAGGGGTTTG
GTTTACGATG ACCAAGGAAA ATATGATTTG GCGATCGCTG AGTTTAACCA AGCCATTCAA
CTTAATCCCA AATATGCTTA TGCTTACAAC AACAGGGGTG TGGTTTACGA TGACCAGGGA
AAATATGATT TGGCGCTCGC TGACTATAAC CAAGCCATTC AACTTAATCC CAAATATGCT
GAGGCTTACA ACAACAGGGG TGGTGTTTAC CTTGAGCAGG GAAAATATGA TTTGGCGATC
GCTGACTATA ACCAAGCCAT TCAACTTAAT CCCAAATTGG CTGAAGCTTA CAACAACAGG
GGTGCGGTTT ACCGTAAGCA GGGAAAATAT GATTTGGCGC TCGCTGACTA TAACGAATCC
ATCAGACTAA ATAACCCTCA GCTTTGGCTG CCTTACAACA ACAGGGGTTT GGTTTACAAT
GACCAGAGAA AATATGATTT GGCGCTCGCT GACTATAGCC AAGCCATTCA ACTTAATCCC
AAAGATGCTT ATGCTTACTA CAACAGGGGT AATGTTTACG ATGACCAGGG AAAATATGAC
TTGGCGATCG CTGACTATAG CCAAGCCATT CAACTTAATC CCAAATATGC TAATGCTTAC
TACACCAGGG GTCTGACTAA CAAGGATCAA AGAAATATGG AAAAAGCTAT ATCAGACTTT
GAAAAAGCTG CCGACTTATA CAAACAACAA GGAAATCAAA CATGGTATCA AAACTCTCTA
GATCAACTCA AAGAATTACG AGGATCTTAA
 
Protein sequence
MKHITKLTVG LITIALVSCN KYESQISTDV STKISRPDPI ITPRQDNAPF NQIYIYRLAE 
NITVEVVARS TVSGGDPFSS SGSGVIIGKQ GSTYYVLTAK HIFQYQDDYR VVVRSKKPGE
DAEILKLEII YRYPDADLAV FKFASVKQYK VAEVGEASQL RKNSEVYVGG WPGAENREGF
QFTPAKVTNP RAGDLLNYEP TVPGEGVYPG MSGGAVLNKA GQLMGIHVGL TKAGDGEGVL
VSTFLRDIPQ QVSRVLVRST SAALPSPIPS PKNRAVGNTD NVTPSQIQNA ESYYEQGDKH
HDREEFEQAL ADYNQAIQLN PKYADAYNNR GIVYRKQGKY DLALADLNQA IQLNPKYADA
YKNRGNVYYN QGKYDLALAD YNQAIQLNPK YAEAYNNRGL VYDDQGKYDL AIAEFNQAIQ
LNPKYAYAYN NRGVVYDDQG KYDLALADYN QAIQLNPKYA EAYNNRGGVY LEQGKYDLAI
ADYNQAIQLN PKLAEAYNNR GAVYRKQGKY DLALADYNES IRLNNPQLWL PYNNRGLVYN
DQRKYDLALA DYSQAIQLNP KDAYAYYNRG NVYDDQGKYD LAIADYSQAI QLNPKYANAY
YTRGLTNKDQ RNMEKAISDF EKAADLYKQQ GNQTWYQNSL DQLKELRGS