Gene Acid345_2780 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2780 
Symbol 
ID4072403 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3293229 
End bp3294398 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content61% 
IMG OID637984798 
ProductN-acetylglucosamine 6-phosphate deacetylase 
Protein accessionYP_591855 
Protein GI94969807 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1820] N-acetylglucosamine-6-phosphate deacetylase 
TIGRFAM ID[TIGR00221] N-acetylglucosamine-6-phosphate deacetylase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.382216 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGACTG CCCTCCTCGC TCGCGAGATC CTGACGCCGC TCGATCGCAT TCATAACGGC 
ATCCTCATCT TCGAAGACGG CTGCATTCTG GAAGTCGGCA ATCGCGATTG CATCGAGGTC
CCCCGCGCCT GCCGCACCAT CGATCTGGGC GACGCAATCC TCACTCCCGG ATTCATTGAT
CTTCACATCC ACGGCGGTGC CGGGCACGAC GTGATGGAAG GCGACGATGC CGCACTCGAA
GCCGTCGAAC TTCTCATCGC GAAGCACGGC GTCACCAGCT ACTGCCCAAC CACGGTTACG
GCAGCAACCG ACGTGACCCT CGTTTCACTC AATAAAATCG GGCACTTCAT CGAGCGCATG
GCTTCGCACG GTCCCGCCAA CAACGGACGC GCGCGCCCCC TCGGCGTGCA CCTCGAAGGC
CCCTTCCTCG CCGAGTCGCG ACGCGGCGTG CATCCGCCGA ACCATCTGCA AGCGCCGTCC
ATCAAGCTCT TTCACGAGAT GTGGCAAGCC GCCATCGGCC GCGTGAAAGT GCTGACCATC
GCACCGGAAT TACCAGGCGC CATCGAGTTG ATTCACGAAG CACGCAAGCG CGGCGTAGTG
GTGAGCCTCG GTCATTCCAA CGCCGATCTC TGCGAAGCCA AGCGCGGCAT CAGCGCCGGC
GGACATCACG CGACGCACAC CTTCAACGCC ATGCGCCCAC TCCAGCACCG CGACGCCGGC
CTACTCGGCG CCATCCTCAC CCAGCAATGC GTCACCGCCG ACATCATCGT CGATGGCATT
CACGTGGATC CCACGGTAGT GAAGTTGTTC CTGCGCGCGA AGGGTGTGGA GGGCGCAGTG
CTCATCACCG ATGCAACCAG TGCGACCGGC ATGCCCGATG GTACGTATCA CCTTGGCAAT
ATCGAAGTGG AAGTGAAGGA CGGGCAGTGC ATCTCGCAAG GCAAACTTGC CGGCAGTGTT
CTCACACTCG ATCGCGCCGT GCGCAACGTG ATGGACTTCG CCGGTTGGAC GCTGCAGAAT
TCAGTACGCC TCGCGACCTA CAATCCCGCC CGCGTGCTCG GCGTGGAAAA CAGCAAAGGC
GTTTTAAAAG CCGGCGCCGA CGCCGACATC CTGGTGATGA ACGCCGCTGG CGAAATCCGA
AATACGATCA TCGGTGGTAT TGGGATCTAG
 
Protein sequence
MKTALLAREI LTPLDRIHNG ILIFEDGCIL EVGNRDCIEV PRACRTIDLG DAILTPGFID 
LHIHGGAGHD VMEGDDAALE AVELLIAKHG VTSYCPTTVT AATDVTLVSL NKIGHFIERM
ASHGPANNGR ARPLGVHLEG PFLAESRRGV HPPNHLQAPS IKLFHEMWQA AIGRVKVLTI
APELPGAIEL IHEARKRGVV VSLGHSNADL CEAKRGISAG GHHATHTFNA MRPLQHRDAG
LLGAILTQQC VTADIIVDGI HVDPTVVKLF LRAKGVEGAV LITDATSATG MPDGTYHLGN
IEVEVKDGQC ISQGKLAGSV LTLDRAVRNV MDFAGWTLQN SVRLATYNPA RVLGVENSKG
VLKAGADADI LVMNAAGEIR NTIIGGIGI