Gene Bind_2020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_2020 
Symbol 
ID6201344 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010581 
Strand
Start bp2307626 
End bp2309266 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content51% 
IMG OID641706007 
Productlevanase 
Protein accessionYP_001833131 
Protein GI182678985 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1621] Beta-fructosidases (levanase/invertase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0275003 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.946482 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGTATA AATATCCTCA TCTAACGACT GGATTAGGGG GCTTACTGGC CCGCTTATTC 
GCCACATTCG TGATTTCATC TGCGATAGAG CTTGGTTCGT TCGGTCTGTT GCATGCCGAG
GAGACAGGCA CGGTCCAGTG GCGTCCGGCC TTGCATTACA CGCCCGAACG GAACTGGATG
AACGATCCAA ACGGATTGGT ATTCAATAAC GGCCTTTATC ATTTGTTCTA CCAATATAAC
CCCAAAGGAA ATGTCTGGGG TAATATGTCA TGGGGACATG CAACCAGCCC TGATCTGATT
CATTGGAATG AACATGATGT GGCGATGTCG GCAAACGAGA CCGAAGAGAT TTTCTCCGGC
TCAATCGTCG TCGACGAACA CAATACATCC AGATTAGGCT CGGCGAATTC ATCCCCTCTC
ATCGCACTTT ATACGAGTGC GTATAAGGCT GGGTCGGGTC ACCCCGCAGG AACTCAGGCA
CAATCGCTTG CCTACAGCCA AGACGAAGCG CAAACGTGGC ATCCGTATGA TCATAATCCC
GTATTGACTC TTAGCCCGGA GTCAAAAAAC TTTCGAGATC CAAAAATTTC CTGGTATCCG
AAAGGAGGTT ATTGGCTGCT GACAACTGTT GTCGCGGATG CGCAAGTCGT TAAAATCTAT
CGCTCCAATA ATCTGCTCAA CTGGGAATTC CTCAGTGATT TCAGTCTCCC TGGTATCCCT
CATCAGGGTG CGCTTTGGGA AATGTCCGAT CTTTTCCCTC TTCCTCTTGA CGGCGATAAA
AACGATCAAA AGTGGGTTAT GATTGTCAAT GTCAACCCTT GGTCAATCGC GGGAGGATCC
GGCGCGCTTT ATTTCGTCGG AGGTTTCGAT GGCAAGGTGT TTGTTCCTGA GCATCTTCCT
CCGGCAGGCT CGGACCCTTC CCAATATTTG TGGCTCGACC ACGGCGCCGA CTTTTATGCG
GCTGGAACAT TTGCCCATGA GCCCCATGGC AAAGCGGTGA TCATGGGCTG GATGAGCAAT
TGGGATTATG CGGAGCATGT CCCGACGGCA CCATGGAAAG GGGCAATGGC CCTGCCGCGT
GTGCTCGCGT TGAAAACAAT CGATGGTATC CCGCAACTCG TCTTTTCTCC CGTCGATCAA
TATACATCCC TAGTCCAGGG ACAGCCGGCG GCGAGAATTG AGACTCTGAC CGTCTCCTCG
TCAATCAAGG AACTTGACCC GTCCACGCAA GGAACCGTGC AGAATATCGC GGTTACCATC
CATCCCGGCG CCGCTCAACG TGCTGGGCTC ATCATACGCG GTTCAGCAAA GGGTGATGTG
GGGACGCGGA TTTTTTATGA CACATCCAAC CACACATTGA CACTCGATCG TTCCCAATCT
GGCGAAACGA ACTTTTCAAG TGCATTCAGT AAACAACATA TTGTCAACTT GCCGCTAGAG
AATGGGGAAC TGCGTCTCAC AATCATTGTG GATAGGAATT CGGTCGAGGT TTTCGCCAAC
AATGGCCGCG CAGTCATCAC GGATCTCATT TTTCCGACTC TTGATGACAA TCGCATCTCT
GTCTTCGCGG AGCATGGCGA TGCGACATTC AATGACCTCG CCATTACCAA TCTCTCCGAT
CTGACTAATA TAAAGCAGTA A
 
Protein sequence
MLYKYPHLTT GLGGLLARLF ATFVISSAIE LGSFGLLHAE ETGTVQWRPA LHYTPERNWM 
NDPNGLVFNN GLYHLFYQYN PKGNVWGNMS WGHATSPDLI HWNEHDVAMS ANETEEIFSG
SIVVDEHNTS RLGSANSSPL IALYTSAYKA GSGHPAGTQA QSLAYSQDEA QTWHPYDHNP
VLTLSPESKN FRDPKISWYP KGGYWLLTTV VADAQVVKIY RSNNLLNWEF LSDFSLPGIP
HQGALWEMSD LFPLPLDGDK NDQKWVMIVN VNPWSIAGGS GALYFVGGFD GKVFVPEHLP
PAGSDPSQYL WLDHGADFYA AGTFAHEPHG KAVIMGWMSN WDYAEHVPTA PWKGAMALPR
VLALKTIDGI PQLVFSPVDQ YTSLVQGQPA ARIETLTVSS SIKELDPSTQ GTVQNIAVTI
HPGAAQRAGL IIRGSAKGDV GTRIFYDTSN HTLTLDRSQS GETNFSSAFS KQHIVNLPLE
NGELRLTIIV DRNSVEVFAN NGRAVITDLI FPTLDDNRIS VFAEHGDATF NDLAITNLSD
LTNIKQ