Gene Bind_2021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_2021 
Symbol 
ID6201278 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010581 
Strand
Start bp2309632 
End bp2311236 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content57% 
IMG OID641706008 
Productlevansucrase 
Protein accessionYP_001833132 
Protein GI182678986 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00566302 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAGTC GATCGTTTAA TGTTTGTATA CGTAGCCTCA TCGCGGGCTC GCTTCTGACT 
GCCACAGCAC TGTCCGCTCA GGCTCAATCG GGTTACCCGA TACCGACTCC GCATTCGGGA
CAAGCCTATG ATCCATTTGC GGATTTTACC GCCAAATGGA CGCGCGCCAA TGCCCGTCAA
ATCAAGGCGC AATCACATGT CCCGGTGTCA CCCGATCAGA ATTCGCTGCC GCTCAATCTG
ACGATGCCCG ATATCCCTGC CGATTTCCCG CAAACCAACC CGGACGTGTG GGTGTGGGAT
ACGTGGCCTC TCGCCGATGT GCATGGCAAT CAGCTGAGCT TCCAGGGGTG GGAGGTCATT
TTCTCGCTGA CCGCTGATCC GCATGCCGGT TATGTTTTCG ATGATCGCCA CGTTCACGCA
CGTATCGGCT TCTTTTATCG CAAGGCCGGA ATTCCCGCGA ACCAGCGCCC GATTGATGGC
GGCTGGATCT ATGGCGGGCA TTTGTTCCCG GATGGTAGCA GCGTCAAAGT CTTCGGTAAC
GTCCCCATGA CGCAAAACGC GGAATGGTCC GGCGGCGCCC GCTTCGTGGG CGGCCCTTAT
GCTGATGGCC CGCAACACGC CTACCTGAAG AACAACAACG TCAGCCTCTA TTACACGGCG
ACATCGTTCA ACCGTAATGC TCAGGGCGGT AACATCACAC CGCCGATCGC CATCATCTCG
CGCGCGGATG GACAAATTCA AGCAGATGAT AAGCATGTGT GGTTCACGGG ATTCGATCAA
CATCTCCCGC TGCTCGCACC CGACGGCAAA TATTATCAGA CCGGTCAGCA GAACGAGTTC
TTCTCCTTCC GCGATCCCTA TGTCTTCCTT GACCCCGCTC ATCCGGGCAA GACCTTCATG
GTCTTCGAAG GCAATACCGC CGTGCAGCGC GGCTCCCGCT CCTGCACCGA GGCAGATCTC
GGATATTCTC CCAATGACCC GAACAAAGAA GACCTGAATG CGGTCATGGA CTCCGGAGCC
ATTTACCAAA TGGCCAATGT CGGTCTTGCC GTGGCGACGA ACGATGAACT GACGCAGTGG
AAGTTCCTGC CGCCGATCCT GTCCGGTAAT TGCGTGAACG ATCAGACCGA ACGTCCTCAG
ATCTATCTGA AGGATGGAAA ATATTACCTG TTCACGATCA GCCACCGCAC GACCTATGCG
GCGGGCGTCG ATGGGCCGGA CGGCGTCTAT GGCTTCGTCG GTGATGGCAT TCGCAGCGAC
TTCATTCCCC TGAATGGCCT CAGCGGCCTC ACGCTCGGCA ACCCGACCGA TCTCTATCAG
CCGGCCGGCG CTCCTTACGC CTTGAATCCA AATCAAAATC CTCGGACGTT CCAGTCCTAT
TCGCATTATG TCATGCCGGG CGGCCTCGTT GAATCGTTTA TCGATGCCAT CGGCACTCGT
CGCGGTGGCG CGCTGGCTCC GACGGTGAAG ATCAACATCA ACAGAACTTC TACCATCCTC
GACAGGACCT ATGGCAATGC CGGATTGGGT GGCTATGGCG ACATCCCGGC CAATCTTCCC
GCGCTTGGCC AAGGTAATGG CCATGGTGTC ACGAACGGCC AGTAA
 
Protein sequence
MASRSFNVCI RSLIAGSLLT ATALSAQAQS GYPIPTPHSG QAYDPFADFT AKWTRANARQ 
IKAQSHVPVS PDQNSLPLNL TMPDIPADFP QTNPDVWVWD TWPLADVHGN QLSFQGWEVI
FSLTADPHAG YVFDDRHVHA RIGFFYRKAG IPANQRPIDG GWIYGGHLFP DGSSVKVFGN
VPMTQNAEWS GGARFVGGPY ADGPQHAYLK NNNVSLYYTA TSFNRNAQGG NITPPIAIIS
RADGQIQADD KHVWFTGFDQ HLPLLAPDGK YYQTGQQNEF FSFRDPYVFL DPAHPGKTFM
VFEGNTAVQR GSRSCTEADL GYSPNDPNKE DLNAVMDSGA IYQMANVGLA VATNDELTQW
KFLPPILSGN CVNDQTERPQ IYLKDGKYYL FTISHRTTYA AGVDGPDGVY GFVGDGIRSD
FIPLNGLSGL TLGNPTDLYQ PAGAPYALNP NQNPRTFQSY SHYVMPGGLV ESFIDAIGTR
RGGALAPTVK ININRTSTIL DRTYGNAGLG GYGDIPANLP ALGQGNGHGV TNGQ