Gene Bind_1603 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_1603 
Symbol 
ID6198873 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010581 
Strand
Start bp1808853 
End bp1810676 
Gene Length1824 bp 
Protein Length607 aa 
Translation table11 
GC content57% 
IMG OID641705594 
Productfilamentous haemagglutinin outer membrane protein 
Protein accessionYP_001832724 
Protein GI182678578 
COG category 
COG ID 
TIGRFAM ID[TIGR01901] filamentous haemagglutinin family N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.17462 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.795272 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCGCT CGGAATCCGT CCTGAAAACC ACCTTGCAAA ATCCGGTGAT GCCCATTTCG 
AGCTTTGGCT CCGAGAGGAT GCGCACAGTA TTCCGTCCCC TCATTGCTAT GACGGCTCTG
GCGGCGCTTA TGGTCCTGCC ACGCCCGGTT GCAGCGGGGC CTGACGGTGG CTCGGTCGTC
GCGGGACAAG CGGCCATCTC GCAGGCGGGC AGCGTCACGA CCATTAATCA GTCCACGCCC
AAAGCCATCA TCAATTGGCA AGGCTTTTCC ATCAACGCGC ATGAGACGGT GAATTTCAAC
CAGCCTTCGA GCAGCGCCGC CACTTTGAAC CGCGTCATCG GCAACGAGGC GAGTGTCATC
GCCGGCGCGC TCAACGCCAA TGGCCAGGTT TTTCTCGTCA ATTCGGCGGG TGTTCTGTTC
ACCCACGGGG CGCAGGTCAA TGTTGGTGGT CTCGTCGCCT CGACCCTCGA TATTGCCAAT
ACAGACTTCA TGGCCGGAAA ATATACCTTC TCCGGCACCT CCTCGGCCTC GGTCATCAAC
CGCGGCCACA TCCTTGCACA TGAGGGCGGC TATGTCGCGC TTCTCGGCAA GACGGTCTCG
AACGAGGGGG TGATCACGGC CACGCTCGGT ACGGTAGCCA TGGCCTCGGG CGAAAAGATC
ACGCTGAACT TCGATGGCAA TTCGTTGATC GATGTGACGA TCGACAAGGG GACCTTGAAC
GCGCTCGTCC AGAACAAAAG GGCGATCCAG GCAGATGGTG GCCGGGTGAT CCTGACCGCC
AAGGCGGCGG ATGCCGTGCT TTCGGCACAG GTGAACAATA GCGGCATCAT CCAAGCGCGC
ACCATGGCCG ATCTCAAGGG TGGCCAAGCC ACATCGGGAT CCACAGGCGG CTCGGTCCAT
GTGGGCACGA TCAAACTTCT GGCGCAGGGC GGGACCACGA AGGTCGCGGG CAAGCTCGAC
GTTTCGGCGC CGAACCAGGG TAACGGCGGA TCGATCGAGA CCGGCGCCAA TAAAGTGCAG
GTGGCCTCAA AAGCAACGAT CATTACGAAA GCGGCGAGCG GACAGGATGG CACTTGGCTC
ATCAGTCCCA AGGATTTCAC GATCATGTTG AGTGGGGACG TCACCGGAAC ACAGCTGGCC
AGCAGCAATA TAACAATCCG GCCGGCGAGT GGAGGAGGCA TTCATTTCGG TGTCAACACG
ACGCTTGCTG ATCTAGGGCA TGAACCGGAT AACATGTACA TCAGCTTTTT TGGTGATTCT
GGTTTACACA CCGCAATTGG TTCAACCAGG GGCATTATTC GCGATATCAG GCTGGAGAGT
GTTGATTTCT GGAGCAGCAC CATTCCCACC GCGCCTATTA CCGGTTCCTA TCCCACTGGA
ACTGTGACAG TCCCCGGTGG TGTCGGCGGC TTGGTAGGGT CTATAGGCAG CACTACTATT
ACTGGCTCCT ATCCCACTGG AATTATGACA GGCTCTGGTA GTGTCGGCAA TCTGATGTGG
TATAGTGGCA CCGCCAGAGA CATTATCTAC TCTTATCCCA CTGGAAATTT GACAAACTCC
GGTAGTGTCG GCGGTCTGGT GGGGTACACT GACGCCCCTA TTAGCAATGT ATATCTCACT
GGAGGTGTGA TAAGCTCCGG TAGTAGTATC TATGGTCTGG CAGGGTACAC GAGCGGCATT
ATTACCAACT CCTATGCCAC TGTAAATTTG ACAAACTCCG GTAGTGTCAG CGGTCTGGTA
GGATACAACA CTGGCGCCCC TATTGGCAGC GTTTATCTCC CTGGAATTGT GCCAGCCCCT
GTTAGCAATG TAAGCGGTGG CTAA
 
Protein sequence
MPRSESVLKT TLQNPVMPIS SFGSERMRTV FRPLIAMTAL AALMVLPRPV AAGPDGGSVV 
AGQAAISQAG SVTTINQSTP KAIINWQGFS INAHETVNFN QPSSSAATLN RVIGNEASVI
AGALNANGQV FLVNSAGVLF THGAQVNVGG LVASTLDIAN TDFMAGKYTF SGTSSASVIN
RGHILAHEGG YVALLGKTVS NEGVITATLG TVAMASGEKI TLNFDGNSLI DVTIDKGTLN
ALVQNKRAIQ ADGGRVILTA KAADAVLSAQ VNNSGIIQAR TMADLKGGQA TSGSTGGSVH
VGTIKLLAQG GTTKVAGKLD VSAPNQGNGG SIETGANKVQ VASKATIITK AASGQDGTWL
ISPKDFTIML SGDVTGTQLA SSNITIRPAS GGGIHFGVNT TLADLGHEPD NMYISFFGDS
GLHTAIGSTR GIIRDIRLES VDFWSSTIPT APITGSYPTG TVTVPGGVGG LVGSIGSTTI
TGSYPTGIMT GSGSVGNLMW YSGTARDIIY SYPTGNLTNS GSVGGLVGYT DAPISNVYLT
GGVISSGSSI YGLAGYTSGI ITNSYATVNL TNSGSVSGLV GYNTGAPIGS VYLPGIVPAP
VSNVSGG