Gene Tery_1587 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_1587 
Symbol 
ID4242736 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp2424735 
End bp2427869 
Gene Length3135 bp 
Protein Length1044 aa 
Translation table11 
GC content38% 
IMG OID638106729 
ProductNB-ARC 
Protein accessionYP_721339 
Protein GI113475278 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3063] Tfp pilus assembly protein PilF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.257434 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGATA ATTTTGAGGA TAAAGTTGAT ATTAAGGAGA CTCATGGTGT TGGTGTTAAT 
TCAGGCAATA TTGTAAATTC AACTTTTGCT AAGACAATTA TTAATGACAC AAAAAATGTT
TCATGGAAGG GTGAGCCAGC AGAGTTTCCT AATAATTTAA ATATATTACG AACTGGTGCT
GTTAAATTTG TTGGTCGAGA TAAAGATATC GAAAATTTAC ATGAACAACT TCAGGAAAAA
GAGCGTGTAT CTATTACAGC AGTAGTTACA GGTATGGCAG GCGTTGGGAA AACAGAATTA
GCACTTCAAT ATTCTCTTTT ATCTGAAAAG GAGTTGAATT ATCCTGGTGG TATTTGTTGG
ATAAATGTTA GGGAAAGAAG TGTGGGAGAA CAGTTATTAA GCTTTGCTCA AACTCAATTA
GGATTATTTC CGGCTGAAGA TTGGAGTTTA GAAGAAAGAA TTAGCTTTTG TTGGTCAAAT
TGGCAACCAC CTGGAGATGT TTTAATTGTT CTAGATGATG TTAATAAATA TGAAGAAATT
GAGCAGTATT TACCTCCACA AAAACAACGT TTTAAATTAT TAATTACTAC TCGTAAATAT
TGGTTGTCAG AATCTTTTTC ACAGTTACGT TTGGAGGTTT TGGATGAAGA TTCTGCTTTA
GAATTATTAG AGGTCTTAAT TGGCAATTCT CGTTTAGGGA CACAAATAGA GGAAGCAAAG
CAACTTTGTG AATGGTTGGG ATATTTACCC TTGGGATTAG AGTTGGTGGG GCGATTTCTG
AAGAGGCGGT CTGAATGGAC ATTGGAAAGA ATGATACAAG AGTTGGAAAA ACAAGCTTTA
AATTTCTCGG TGCTACAGAA CCCACCACAG GGAGAAATGA CAGCACAGCG TGGAGTAGCC
GCTGCTTTTG AATTGAGTTG GAATGAATTA GATGAAAGGG GAAGGTATTT GGGCTGTTTG
TTGAGTCTTT TTGCATTGGC TCCTATACCT TGGAATTTGG TGGAAAAATG TTTGTCTAAA
GATGAAAGTC AGGAGAAAGG AATTATACAA AGGTGGTTTC CTACTTTTTC ACGTTTATGG
TTATTATTGA TGCCTCCAAA AAAAGTTGAT GTATTAGATT CAAGAACTTG GGAAGATATT
AGGGAAGATA CTTTGTTAGA TTTAAATTTA ATTCAAAAGA CAACACAGGG AACTTATGAA
TTGCATCAAT TAGTACGTCG ATATTTTCAA GATAAGTTGA ATGCAATGAA GGAAGTAGAA
CAGTTGAAGT CTCAGTTTTG TCGGGTAATT GTAGGTGCGG CAGAGAAAAT TCCTTATAAC
AATGACATTA CAGTAGAACA AGTTAAAGAA GTTGAGATTG ATATACCTCA TATTACAGAA
ATAGCAGACA ATTTGGCTGA ATATTTGAGC GATGATGATT TGATTATACC TTTTACAAGC
TTAGGCTCAT TTTATCAAGG TCAAGGATTG TACCCACTGG CACAACCTTG GTTAGAGAAA
GGTAAAGAAA TAGCTGAAAA ACGTTTAGAT AAAAATAATT CTGATATTGC AGCTATTTAC
AACAACCTGG CATCATTATA TCGTGCACAA GGAAAATACG AAGCAGCTGA ACAATTGTAC
CTACAAGCAA TAGAAATCCA CAAAATTGCC CTCCCTGAAA ATCATCCAGG TATTGCCACA
CACCTCAACA ACCTGGCAAA TTTATATCGT GTACAAGGAA AATACGAAGC AGCAGAACCT
TTGTTCCTAC AAGTAATAGA AATCCACAAA ATCGCCCTCC CTGAAAATCA TCCAAATATA
GCCAGCGGCC TCAACAACCT GGCAGCATTA TATAAGTTAC AAGGAAAATA CGAAGCTGCA
GAACCTTTGT TCCTACAAGC AATAGAAATC GACAAAATCG CCCTCCCTGA AAATCATCCA
TCTCTTGCCA CAGACCTCAA CAACCTGGCA TTATTATATC ATTCACAAGG AAAATACGAA
GCTGCAGAAC CTTTGTTCCT ACAAGCAATA GAAATCGACA AAATCGCCCT CCCTGAAAAT
CATCCAAATA TAGCCAGCGG CCTCAACAAC CTGGCAGCAT TATATAAGTT ACAAGGAAAA
TACGAAGCTG CAGAACCTTT GTACCTACAA GCAATAGAAA TCGACAAAAT CGCCCTCCCT
GAAAATCATC CACAACGTGC CACACACCTC AACAACCTGG CAAATTTATA TCGTGCACAA
GGAAAATACG AAGCAGCAGA ACCTTTGTAC CTACAAGCAA TAGAAATCCA CAAAATCGCC
CTCCCTGAAA ATCATCCAGG TATTGCCACA CACCTCAACA ACCTGGCAAA TTTATATCGT
GTACAAGGAA AATACGAAGC AGCAGAACCT TTGTTCCTAC AAGTAATAGA AATCCACAAA
ATCGCCCTCC CTGAAAATCA TCCAAATATA GCCAGCGGCC TCAACAACCT GGCAGCATTA
TATAAGTTAC AAGGAAAATA CGAAGCTGCA GAACCTTTGT TCCTACAAGC AATAGAAATC
GACAAAATCG CCCTCCCTGA AAATCATCCA TCTCTTGCAA GAGACCTCAA CAACCTGGCA
GAATTATATC GTGAACAAGG AAAATACGAA GCTGCAGAAC CTTTGTTCCT ACAAGCAATA
GAAATCGACA AAATCGCCCT CCCTGAAAAT CATCCATCTC TTGCCACAGA CCTCAACAAC
CTGGCATTAT TATATCATTC ACAAGGAAAA TACGAAGCTG CAGAACCTTT GTTTCTACAA
GCAATAGAAA TCGACAAAAT CGCCCTCCCA GAAAATCATC CACAATTAGC CACACACCTC
AACAACCTGG CAGGATTATA TCATGCACAA GGAAAATACG AAGCTGCAGA ACAATTGTAT
CTACAAACAA TAGAAATCGA CAAAATCGCC CTCCCTGAAA ATCATCCATC TCTTGCAAGA
GACCTCAACA ACCTGGCAGA ATTATATCGT GAACAAGGAA AATACGAAGC AGCTGAACCT
TTGTACCTAC AAGCTATTGA AATATTTACA CAATCATTAG GTGAAGAACA TCCCAACACT
CAAACAGTTC TGAAAAACTA TCAAATATTT TTAAATGAGA AAAATGAATC AAAACAAAAT
CAAGATAAAT ATTAG
 
Protein sequence
MSDNFEDKVD IKETHGVGVN SGNIVNSTFA KTIINDTKNV SWKGEPAEFP NNLNILRTGA 
VKFVGRDKDI ENLHEQLQEK ERVSITAVVT GMAGVGKTEL ALQYSLLSEK ELNYPGGICW
INVRERSVGE QLLSFAQTQL GLFPAEDWSL EERISFCWSN WQPPGDVLIV LDDVNKYEEI
EQYLPPQKQR FKLLITTRKY WLSESFSQLR LEVLDEDSAL ELLEVLIGNS RLGTQIEEAK
QLCEWLGYLP LGLELVGRFL KRRSEWTLER MIQELEKQAL NFSVLQNPPQ GEMTAQRGVA
AAFELSWNEL DERGRYLGCL LSLFALAPIP WNLVEKCLSK DESQEKGIIQ RWFPTFSRLW
LLLMPPKKVD VLDSRTWEDI REDTLLDLNL IQKTTQGTYE LHQLVRRYFQ DKLNAMKEVE
QLKSQFCRVI VGAAEKIPYN NDITVEQVKE VEIDIPHITE IADNLAEYLS DDDLIIPFTS
LGSFYQGQGL YPLAQPWLEK GKEIAEKRLD KNNSDIAAIY NNLASLYRAQ GKYEAAEQLY
LQAIEIHKIA LPENHPGIAT HLNNLANLYR VQGKYEAAEP LFLQVIEIHK IALPENHPNI
ASGLNNLAAL YKLQGKYEAA EPLFLQAIEI DKIALPENHP SLATDLNNLA LLYHSQGKYE
AAEPLFLQAI EIDKIALPEN HPNIASGLNN LAALYKLQGK YEAAEPLYLQ AIEIDKIALP
ENHPQRATHL NNLANLYRAQ GKYEAAEPLY LQAIEIHKIA LPENHPGIAT HLNNLANLYR
VQGKYEAAEP LFLQVIEIHK IALPENHPNI ASGLNNLAAL YKLQGKYEAA EPLFLQAIEI
DKIALPENHP SLARDLNNLA ELYREQGKYE AAEPLFLQAI EIDKIALPEN HPSLATDLNN
LALLYHSQGK YEAAEPLFLQ AIEIDKIALP ENHPQLATHL NNLAGLYHAQ GKYEAAEQLY
LQTIEIDKIA LPENHPSLAR DLNNLAELYR EQGKYEAAEP LYLQAIEIFT QSLGEEHPNT
QTVLKNYQIF LNEKNESKQN QDKY