Gene Bind_2221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_2221 
Symbol 
ID6199536 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010581 
Strand
Start bp2547114 
End bp2548676 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content60% 
IMG OID641706210 
Producthypothetical protein 
Protein accessionYP_001833328 
Protein GI182679182 
COG category[R] General function prediction only
[S] Function unknown 
COG ID[COG0645] Predicted kinase
[COG2187] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCCCG TATCTCCAGG CGAACTGGCC CATGACACCC ACGATGACGG ACAGGCGGAG 
ACGATAGCGT TCCTGCAATC GGAGAAAGCG TTTGGCGAAA AGCCGCTCAG GCCCCCCATA
GCGACGCATA TTTCGCTGAT TTTTCTGTAC GCACAGCGCG CCATCAAGCT CAAACGCGCA
GTCCATTTTC CCTATGTCGA TTTTTCCACA CCGGCCCTGC GGTTGGCCGC CTGCGAGCGT
GAATTGGCGC TCAATCGCCG CACAGCGCCC ACTCTCTATT CGGGTGTACG GCGCATTACA
CGGGACAATG ACGGATCTTT ACAGATCGAC GGAAAGGGAC CTCTGGTCGA TGCCGTCGTG
GATATGCGGC GGTTCGACGA TGACGCCCTG CTCGCCCATC ACGCCGAGCA GGGCGGCCTC
CCCATCCCGC TCCTGACCAA GCTCGCTCAG ACCATTGCGA CCTTTCACCG CGCCGCCGAA
ACTGTCGCGA ATGGAAGGGA CGATGAAACC GCCACAGCGC GGCTGGCCAG GATAATCGAC
CTCAACGAAG CGGCTTTCGC GAGCAATAGC ATCATTCCCG CACAAGCGTC CTTGGCACTC
GCTCAAACTT TTCGCGCCAG GCTGGCAGCG CTTGCAAGTC TGCTCGATCA TCGCGCCAAG
GCGGGCAAGA TCCGCCATTG TCACGGCGAT CTGCACCTGC GCAATATTTG TCTCCTCGAG
GGAGAACCCA CGTTGTTTGA TTGCCTCGAG TTCGATGATG ATATGGCGCG TGTCGATATT
CTCTATGATC TCGCCTTTCT TCTGATGGAT CTCTGGCATC GCGGCCTAGC GCGGGAAGCC
AATTGGATTT TCAACCGCTA TCTAGATCAG ATGGACGAAG ACGACGGCTT GACCGCCATG
CCCTTTTTCA TGGCGCTGCG CGCGGCCATT CGCGCCCATG TCGCGGCGAC GCTCGGCCGA
TCCAACGAAG CCCTCTCTTA TTTCGCCCTC GCCCAGGCGC TTCTGCATCC ACGACCCGCT
GCACTGGTCA GCATTGGTGG TCTTTCTGGA ACCGGCAAAT CGACACTTGC CGCCGCCTTG
GCACCCGCGA TCGGGCCGGC GCCTGGCGCC CGCGTTCTTT CGAGCGACCG TATCCGCAAA
GGATTATTCG GTGTTCGCGC TGAGACGCGC CTGCCCCCCG AGGCCTATGC GCCGGAAGTC
TCAGCCCGCG TCTACGCGCG AATCACGACC CTGGCCGAGA CGATTTTGCA TCTCGGCCAG
GGTGTCGTCG CCGATGCCGT TTTTGACCGG ATGGATGACC GCGCCGAAAT CGAACGCGTT
GCCGCGAAGG GGAATGTCCC CTTTCTCGGC TTCTGGCTTG AAACGAGCCT CGAACGCCAG
ATCGAACGTG TCGAGGCACG GCGCAATGAT GCATCCGACG CTGACGCGAC AATCGTGCTT
GCCCAAAGAG ATCGCGACAC AGGCGCCATC CAATGGCATC ATCTTGTTTC CGACCATGAG
GCCACGACAA CAGCCCGGCA AGCCTTGGAG ATCTGCCAAG CCCGTCTCGA ATGCCCGGCT
TGA
 
Protein sequence
MPPVSPGELA HDTHDDGQAE TIAFLQSEKA FGEKPLRPPI ATHISLIFLY AQRAIKLKRA 
VHFPYVDFST PALRLAACER ELALNRRTAP TLYSGVRRIT RDNDGSLQID GKGPLVDAVV
DMRRFDDDAL LAHHAEQGGL PIPLLTKLAQ TIATFHRAAE TVANGRDDET ATARLARIID
LNEAAFASNS IIPAQASLAL AQTFRARLAA LASLLDHRAK AGKIRHCHGD LHLRNICLLE
GEPTLFDCLE FDDDMARVDI LYDLAFLLMD LWHRGLAREA NWIFNRYLDQ MDEDDGLTAM
PFFMALRAAI RAHVAATLGR SNEALSYFAL AQALLHPRPA ALVSIGGLSG TGKSTLAAAL
APAIGPAPGA RVLSSDRIRK GLFGVRAETR LPPEAYAPEV SARVYARITT LAETILHLGQ
GVVADAVFDR MDDRAEIERV AAKGNVPFLG FWLETSLERQ IERVEARRND ASDADATIVL
AQRDRDTGAI QWHHLVSDHE ATTTARQALE ICQARLECPA