Gene Bind_1009 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_1009 
Symbol 
ID6200437 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010581 
Strand
Start bp1157789 
End bp1159195 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content60% 
IMG OID641705000 
Productpeptidase M16 domain-containing protein 
Protein accessionYP_001832141 
Protein GI182677995 
COG category[R] General function prediction only 
COG ID[COG0612] Predicted Zn-dependent peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGCGA TGGTCTGCAT GTCTCAGGGT CAGTCGAGCG CGGCATCACC TGTTCAAGGT 
TCTTTCGGCC TCACCCCTTC GCCAGCCAAT CCGGCCAAGG CCCATGGGCC GCAAGTCAGT
CAGGCGAAGC TCGCCAATGG CATGGACATT GTGGTCATTC CTGACCATCG CGCGCCTGTC
ATCACCCATA TGGTCTGGTA TCGTAACGGG TCCGCCGATG ATCCAGTGGG CAAGTCCGGC
ATTGCGCATT TCCTTGAACA TCTCATGTTC AAGGGCACCA AGGATCACAA ACAGGGCGAA
TTTTCCGAAG TGATCGCCGA TTTCGGCGGT CAGGAAAACG CCTTCACCTC GAATGATTAT
ACCGCCTATT TCCAGCGGGT CGCCAAGGAC CATCTCCGCG TCTGCATGAA TTACGAGGCT
GACCGGATGA AAAATCTGGT CCTCTCCGAT GAAGTGGTCG CCCCCGAGCG CGATGTCGTG
CTCGAGGAGC GCCGCATGCG CACGGATTCC GATCCCTCGG ACCTTCTGAA CGAGGCAGTC
CAGGCCGCCC TTTATACGCA TCATCCCTAT GGCAAGCCGA TCATCGGTTG GAGCCATGAG
ATCGAAACCC TCGATCGCCA GGATGCGTTT GCCTATTACG ATCGTTTCTA TACGCCAGAA
AATGCGATTC TCGTCGTCGC CGGCGATGTC GAGCCCGATG AGGTTCTGGC GCTTGCCGAG
GATGTCTATG GCAAGATCCC GGCCCATGGC GAGGCGCCGC GTCGCTCGCG TCCCCGTGAG
CCCGAGCCGC GCGCTCATCG GCTCGTCAAG CTCGTCGATG AAAAGGTCGA ACAGCCGACG
CATCAGGGGG TCTTCCTCGT CCCGTCCTAC AAGACGGCCG CGCCTGGCGA AGCGGAAGCG
CTTGAAGTTC TCGGCCATTT GCTGGGCGGC GGTCAGACCA GCCTGTTGTT CAAAAAGCTC
GTCGTGGCCG ATAAAGTCGC CGTTGCCGCC GGCGCCCATT ACCAGGGGAC GGCTGTCGAT
CAGACGCGCT TCTATGTTTA TGGCATCCCG GCGCCAGGCA TTACGCTCGA GGAAATCGAC
AATGCCATTG ACGCCGTCAT TGCCCATGTG GCCAAGGAAG GCGTCTCGGA AGCGGATCTG
CGTCGCACCA AGACCCGACT CGTCGCAGAG GCGATCTATG CCCAGGATAA TCAATCGACA
TTGGCGCGGT GGTATGGCGC TTCGCTCAGC GTTGGCCTGA CCCTGAACGA TATTGCCGAA
TGGCCGGCGC GAATCGAGGC CGTTACCCTG GAGGATGTCA AGAAAGCCAC GCGCTGGCTC
GCCAAAAGGC GCGGCGTTAC GGGTTTCCTC CTGCCGGCCC ATGCCCCAGG AGAACATACG
ATCGAGGTCG AGACCGAGGC CAGTTGA
 
Protein sequence
MKAMVCMSQG QSSAASPVQG SFGLTPSPAN PAKAHGPQVS QAKLANGMDI VVIPDHRAPV 
ITHMVWYRNG SADDPVGKSG IAHFLEHLMF KGTKDHKQGE FSEVIADFGG QENAFTSNDY
TAYFQRVAKD HLRVCMNYEA DRMKNLVLSD EVVAPERDVV LEERRMRTDS DPSDLLNEAV
QAALYTHHPY GKPIIGWSHE IETLDRQDAF AYYDRFYTPE NAILVVAGDV EPDEVLALAE
DVYGKIPAHG EAPRRSRPRE PEPRAHRLVK LVDEKVEQPT HQGVFLVPSY KTAAPGEAEA
LEVLGHLLGG GQTSLLFKKL VVADKVAVAA GAHYQGTAVD QTRFYVYGIP APGITLEEID
NAIDAVIAHV AKEGVSEADL RRTKTRLVAE AIYAQDNQST LARWYGASLS VGLTLNDIAE
WPARIEAVTL EDVKKATRWL AKRRGVTGFL LPAHAPGEHT IEVETEAS