Gene Paes_2172 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPaes_2172 
Symbol 
ID6458675 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProsthecochloris aestuarii DSM 271 
KingdomBacteria 
Replicon accessionNC_011059 
Strand
Start bp2345733 
End bp2346860 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content54% 
IMG OID642726148 
ProductBeta-N-acetylhexosaminidase 
Protein accessionYP_002016821 
Protein GI194334961 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGA AAGCACTCCT CTCACTCCTC GTTCTGCTTA TCACAAACAT ACTCTCCGCC 
TGCGCTTCCA ATCAGCAGGC AGAGGATGGT ATCGATGCAA AAATCGGCAG GATGATCATG
GTCGGATTCC GCGGAATGTC GATTGAAGAG GCTCCGTGGA TCAGCGACGA TATCGCCAGC
AAGAGAATCG GGGGAGTGAT CCTCTTCGAT TACGACGTTC CTTCGGCATC GACGACGAGA
AACATCGCAT CGCCCGGCCA GCTTGCCGCA TTGACCCGTC AACTCCAGGA GTGTTCCCCC
GAACCGCTTC TGATCGCCAT CGATCAGGAA GGCGGCAGAG TTTCACGACT GAAACCCTCC
CGGGGATTTC CCGAAAGCGT TTCAGCGGCG CATCTTGGCG CCGTCAACGA TCCCGACAGC
ACCTTGCGGA GCGCAGCAAC AACTGCGGCA ACGCTGCAAT CGATGCACAT CAACCTGAAC
TTCGCCCCGG TAGCTGATGT CAATATCAAT CCGGACAACC CCGTCATAGG CCGTCTGGAA
CGAAGCTTCT CGTCCGACCC TGCAATCGTC GCATTGCATG CGGCAGCAAC AGTACAGGCC
ATGCACGAAG CAGGGATCCA TACTGCGCTG AAACACTTCC CCGGCCACGG CAGTTCAACA
ACCGATACCC ATAAGGATTT CACCGACGTC ACCACCACCT GGACGCCGAA AGAACTCGAT
CCCTACAGGG CACTCATCAA AGAAGGATAC CGCGATTTCA TCATGACTGC GCATGTATTC
AACGCTCAGC TCGACCCTGA TTATCCGGCA ACACTGTCAC AGAAAACCAT CACCGGCATG
CTTCGCGACT CGCTCGGCTT CAGGGGCGCT GTCATCAGTG ACGACATGCA GATGCAGGCT
ATAGCCGCCC ATTACGGGCT CGAAACAGCT ATCAGGCTGG CTCTCGATGC TGGAGTTGAT
ATTCTGCTCT TCGCCAACAA TTCGACCTAC GACCCCGATA TTGGGAGGAA AACATTTACA
ATCATCAAAA CACTCGTCGA TAACGGCACC ATCAGCAGGA AACGGATCGA AGAATCGTGG
GAGCGGATCA ATACCATGCA ACACAACCTT TTACCGGCAG AACAATGA
 
Protein sequence
MKKKALLSLL VLLITNILSA CASNQQAEDG IDAKIGRMIM VGFRGMSIEE APWISDDIAS 
KRIGGVILFD YDVPSASTTR NIASPGQLAA LTRQLQECSP EPLLIAIDQE GGRVSRLKPS
RGFPESVSAA HLGAVNDPDS TLRSAATTAA TLQSMHINLN FAPVADVNIN PDNPVIGRLE
RSFSSDPAIV ALHAAATVQA MHEAGIHTAL KHFPGHGSST TDTHKDFTDV TTTWTPKELD
PYRALIKEGY RDFIMTAHVF NAQLDPDYPA TLSQKTITGM LRDSLGFRGA VISDDMQMQA
IAAHYGLETA IRLALDAGVD ILLFANNSTY DPDIGRKTFT IIKTLVDNGT ISRKRIEESW
ERINTMQHNL LPAEQ