Gene Clim_2414 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_2414 
Symbol 
ID6355885 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp2644223 
End bp2645344 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content54% 
IMG OID642670004 
ProductBeta-N-acetylhexosaminidase 
Protein accessionYP_001944414 
Protein GI189347885 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATACAAA AAACACTCTT TGCCGCTCTC TTCATCTTCA TGCAGCTCTC TTCGCTTCCA 
GCGCAGGCCG CTGCTCCAGT TGACAGTCTT TCCATAAAAA TCGGCCAGAT GCTTATGATC
GGGTTCCGGG GGCTGACAGC GAAAGCGCCC GGAATAGCTG ACGATATCCG CAAGCGTCAT
ATCGGCGGCG TGGTGCTATT CGATTATGAT GTACCATTGA AATCACCGGT ACGGAATATC
GCTGGCCCCG AGCAGCTGTC GAAACTAACG CGTGAGCTGA TGGATCTTTC GGAAATCCCG
CTGTTCATCG CGCTTGACCA GGAAGGCGGA AAGGTGAACC GTCTGAAAAC CTCAAAGGGA
TTTCCCCCCT CGGTTTCAGC TGCACACCTC GGCATGCTCG ATAACCCGGA CAGCACAACC
GCCGCAGCGC GACAGACCGC CGCGACGCTG AAAAAAATGC ACCTGAACAT GAACCTTGCG
CCGGTGCTCG ACCTGAACAC CAATTCTGAG AATCCGGTCA TCGGCAAACT TGGTCGCAGC
TACTCCGCTG ATCCTGCAGT CGTCACGCGT CATGCCGGGC TGACGGCGAG AGTTTTTCGT
GAAGAGGGAA TCATTCCGGT CTTCAAACAC TTTCCGGGGC ACGGCAGCTC AACAACGGAC
TCCCACAAGG GCTTCACGGA CGTTACCGCA AGCTGGACGA AAAAAGAGAT TGAACCGTAC
CGTTCGTTGA TCGCGGCCGG CTACGACGAT GCCGTCATGA CAGCTCATGT GTTCAACAGG
CAGCTTGACG ACCGCTATCC GGCCACACTT TCGCAGAAGG TACTGAACGA CCGTCTGCGC
AGCAGACTCC GCTTCGACGG AGTTATCCTG AGCGATGATA TGCAGATGAA AGCCATTGCC
GACCAGTTCG GACTTGAAGA TGCCATCAGA CTGGCTCTCG ATGCAGGAGT GGATATCCTG
ATCTTTGGCA ACAACACCAC ATTCGATCCC GCAATTGCTG AAAAAGCCAC AGCAATCCTC
CATGAGCTTG TACAAAACGG TACGGTAAGC CGAGCCCGTA TTGACCGCTC CTACCGGAGA
ATCATGGCTC TCAAGGAACG CTACCTCTAC CACTGCAAAT AA
 
Protein sequence
MIQKTLFAAL FIFMQLSSLP AQAAAPVDSL SIKIGQMLMI GFRGLTAKAP GIADDIRKRH 
IGGVVLFDYD VPLKSPVRNI AGPEQLSKLT RELMDLSEIP LFIALDQEGG KVNRLKTSKG
FPPSVSAAHL GMLDNPDSTT AAARQTAATL KKMHLNMNLA PVLDLNTNSE NPVIGKLGRS
YSADPAVVTR HAGLTARVFR EEGIIPVFKH FPGHGSSTTD SHKGFTDVTA SWTKKEIEPY
RSLIAAGYDD AVMTAHVFNR QLDDRYPATL SQKVLNDRLR SRLRFDGVIL SDDMQMKAIA
DQFGLEDAIR LALDAGVDIL IFGNNTTFDP AIAEKATAIL HELVQNGTVS RARIDRSYRR
IMALKERYLY HCK