Gene Tery_1822 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_1822 
Symbol 
ID4244536 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp2778826 
End bp2781759 
Gene Length2934 bp 
Protein Length977 aa 
Translation table11 
GC content40% 
IMG OID638106944 
ProductNB-ARC 
Protein accessionYP_721552 
Protein GI113475491 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3063] Tfp pilus assembly protein PilF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.244157 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.101864 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGGTT TCTTTTTGGT ATTTGACTGC AAAAATTATC CTCCTCAAGG GGATGTTGTT 
TTTGTTCATG GTTTGGCTGG TCATCCTTGG GGTACATGGC ATCCTCAAAA TAAAAAGGAT
AAAGAAGATG TTGACTTTTT GCTGTGTTGG TTGGGTGAAG AACTACAAGC ACATGGAATA
GATGTTAATG TCTGGAGTTT TGGATATGAT GCTCCTGGTT TTCAATATTT TGGTCAGGGA
ATGCCGCGTT TTGATCTGGC TAGTAATTTA CTGGAGTATC TTCAGGTTTA TGATATTGGC
AAGACATCCA CTAGTTTCCC CAAACGCCCA TTGATATTTG TAACTCACAG TTTGGGTGGT
TTGGTGGTGA AGGAGGTAAT TCGTACAGCT CAAAGTTTTC CTGAGTATCT AGCTATTGTT
AAACAGGTTA AAGGAATTGT ATTTTTGTCT ACTCCACACA CGGGAACTCA TTTGGCTAAT
TTAATTGATC ACGTCGGTTT TTTGACTAGA CCTACAGTGA ATGTTGAAGA GTTAAAAGAA
CATTCCCCTC AACTGCGAGA TCTTAATGAA TGGTATCGAC AAAATGTTGG CAAGTTGGAA
ATTAAAACTA AGGTTTTTTA TGAAACAAAG TCTCTGAATG GGGTTTTGGT TGTGAATGAG
GATAGCGCTA ACCCTGGTAT TCACGATGTT AAACCTATTG CTGTTTCAGC AGAAGACCAC
AACTCTATTG CTAAACCTGG AAAAAATGAT TTGGTTTATC TTTCGGTGAG AAAGTTTTGT
CAAGATATTT TTGCTTTAGA GGAAAGCACT TCTTCTCAAT ATCTTCACCA AAAATACTAT
ATTCCTGGGG AGGCTAGCTT AGAGAATAAA GTTTTTGTAG GTAGGGAAAA AGAATTAGCT
GATGTCGACA AGTTACTCAA AAATTATCAG CGGGTTTCTA TTGTTTCTGT TTCTGGAATG
GGTGGTGTTG GTAAAACAGA ACTTTGCCGC CGATATGCTT ATGCTCATAA GTCTGCTTAC
CCTGGAGGTA TTTGTTGGTT GGAAGCGCCG ACAGAAAATG CAGGTATTCA GATACTTAGG
TTTGCTCAGA ATAATTTCCA GCTTGTTTTC TCTTCAGATC AAGATTTACC GGAAAGGTTG
CGCTACTGTT GGCAAAAGTG GTCTGAGGGG AATACTTTGC TGATCTATAA TGATGTAACT
GACTACAATA CTCAGGCCAA GCCTTTTTTA CCTCCGGACT TGTCTCGGTT TCGGGTGTTG
CTGACCACTC GTAAAAGTTT TGGCTCGGCA TTTCCGGAGT TGCGTCTCGA TGTGTTAAAG
CCTTTGGCGG GGATGAAGTT GTTAAGGTCT ATTTTGGGTA GAAAGAGACT TTTGCTAGAA
CCTAGGAAGG CTAGAGAACT GCTGATTTTT TTGGGTTATT TACCTTTGGG AATTGAATTG
GTGGGGCGAT ATTTGGATGA GTATTGGCAG AGTTTGAATA GGGATGGTTT GGCTTTGACA
AAAATGCTTA AGCGTTTAGA AAGAAAGAGT CTGGAACATC AGGCTATGTC ATCTAATGAG
TTAATTAATT ATCCTTATGG TGTGGCGGAG GCGATCGCTT TGAGTTGGGA GATGTTGGAT
GAAAATACTC AGGAAATTGG GTTGAAGTTA GGTTTATATG CTTTGGCACC TATTCGTTTG
TGGTGGGATG GGATAGAAGA TGATGAAGAG TTGGAAGGTT GGGAAATTGC TCTTGGGAAT
TTGGAAAATC TGCATTTACT GAAGAGTGTT GAGCCTGGTG TTTATATTTT GCATTCTCTA
GTGCGAGAGT TTTTGCAGAT GAAGTTGAAA GAATATCCAA GGGCAGATGA GTTAAAGCGG
GGTATTTGTC AGGTAGTGGC AGAGGCGGCT AGGAATATTC CTGATAACAT TACAGTGGAG
CAAGTTAAGG AAGTTGAGGT TGATATTCCT CATATTACGG AAGTGGCGGC AGTTTTGACT
GAGTATTTGA GTGATGATGA TTTGATTTCT CCTTTTAATG GACTAGGTCG GTTCTATCTA
GGTCAAGCAT TGTACTCACA GGCACAACTT TGGTTAGAGA CAGGTAAAGA AATAGCAGAA
AAACGGCTAG ACAAAGATAA TGCTGACATT GGGAATATCT ACAACAGTCT GGCTTTATTA
TATAAGTCTC AAGGAAAATA CGAAGCTGCA GAACCTTTGT ACCTACAAGC TATTGAAACC
GCAAAAATAG CCCTCCCTGA AAATCATCCA TCTATTGCCA CAGGCCTCAA CAACCTGGCA
AATTTATATT ATTCTCAAGG AAAATACGAA GCAGCAGAAC CATTGTACCT ACAAGCTCTT
GAAATCAAAA AAATAGCCCT CCCTGAAAAT CATCCACAAC GTGCCAGCGG CCTCAACAAC
CTGGCAGGTT TATATTATTC TCAAGGAAAA TACGAAGCAG CAGAACCATT GTACCTACAA
GCTCTTGAAA TCGACAAAAT AGCCCTCCCT GAAAATCATC CACAGTTTGC CACTCACCTC
AACAACCTGG CAAAATTATA TAGATCTCAA GGAAAATACG AAGCTGCAGA ACCATTGTAC
CTACAAGCTC TTGAAATCGA CAAAATAGCC CTCCCTGAAA ATCATCCACA GTTTGCCACT
CACCTCAACA ACCTGGCAAA ATTATATAGA TCTCAAGGAA AATACGAAGC TGCAGAACCA
TTGTACCTAC AAGCTCTTGA AATCAACAAA ATAGCCCTCC CTGAAAATCA TCCAGATATT
GCCACTGACC TCAACAACCT GGCTTTATTA TATGAGTCTC AGGGAAAATA CGAAGCTGCA
GAACCTTTGT ACCTACAAGC ACTGAAGATA TTAAAACAAT CATTAGGAGA AGAACATCCT
AATACTCAAA CAGTTCAGAA AAACTATCAA AATTTCTTAA ATGAGAAAAA ATGA
 
Protein sequence
MLGFFLVFDC KNYPPQGDVV FVHGLAGHPW GTWHPQNKKD KEDVDFLLCW LGEELQAHGI 
DVNVWSFGYD APGFQYFGQG MPRFDLASNL LEYLQVYDIG KTSTSFPKRP LIFVTHSLGG
LVVKEVIRTA QSFPEYLAIV KQVKGIVFLS TPHTGTHLAN LIDHVGFLTR PTVNVEELKE
HSPQLRDLNE WYRQNVGKLE IKTKVFYETK SLNGVLVVNE DSANPGIHDV KPIAVSAEDH
NSIAKPGKND LVYLSVRKFC QDIFALEEST SSQYLHQKYY IPGEASLENK VFVGREKELA
DVDKLLKNYQ RVSIVSVSGM GGVGKTELCR RYAYAHKSAY PGGICWLEAP TENAGIQILR
FAQNNFQLVF SSDQDLPERL RYCWQKWSEG NTLLIYNDVT DYNTQAKPFL PPDLSRFRVL
LTTRKSFGSA FPELRLDVLK PLAGMKLLRS ILGRKRLLLE PRKARELLIF LGYLPLGIEL
VGRYLDEYWQ SLNRDGLALT KMLKRLERKS LEHQAMSSNE LINYPYGVAE AIALSWEMLD
ENTQEIGLKL GLYALAPIRL WWDGIEDDEE LEGWEIALGN LENLHLLKSV EPGVYILHSL
VREFLQMKLK EYPRADELKR GICQVVAEAA RNIPDNITVE QVKEVEVDIP HITEVAAVLT
EYLSDDDLIS PFNGLGRFYL GQALYSQAQL WLETGKEIAE KRLDKDNADI GNIYNSLALL
YKSQGKYEAA EPLYLQAIET AKIALPENHP SIATGLNNLA NLYYSQGKYE AAEPLYLQAL
EIKKIALPEN HPQRASGLNN LAGLYYSQGK YEAAEPLYLQ ALEIDKIALP ENHPQFATHL
NNLAKLYRSQ GKYEAAEPLY LQALEIDKIA LPENHPQFAT HLNNLAKLYR SQGKYEAAEP
LYLQALEINK IALPENHPDI ATDLNNLALL YESQGKYEAA EPLYLQALKI LKQSLGEEHP
NTQTVQKNYQ NFLNEKK