Gene Xaut_5022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagXaut_5022 
Symbol 
ID5420503 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameXanthobacter autotrophicus Py2 
KingdomBacteria 
Replicon accessionNC_009717 
Strand
Start bp241445 
End bp242491 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content69% 
IMG OID640873676 
Productcarbonic anhydrase 
Protein accessionYP_001409456 
Protein GI154243883 
COG category[R] General function prediction only 
COG ID[COG0663] Carbonic anhydrases/acetyltransferases, isoleucine patch superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.585067 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCGT CTCCCTTCGT CCTGCCTTAT CACGGCATCT TGCCGGTCTA TCCGGACCTG 
CTGCGTGCGG GTCCGCGCGC GGCGCTCATT GGCCGGGTCA CGCTTGGGCC CCGCGCCGCG
CTCGGCACCC TCGCGCTTAT CCGAGGCGAC GGCCACGTTG TGGAGATCGG CGCGGACTTC
CATCTCGGCG ACTGGGGCAC GGTCCATATC GCCCACGAGA TGCATCCAGC GATTGTCGGG
GATCGGGTCA CCGTTGGACC AGACGCGGTG GTCCATGCCT GCACGGTCGG CGATGACTGC
GTGATCGAGG AGGATGCCAT CATCCTTGAC GGATCCGTCC TGGAGAACGG CGTCGTCATG
GAGGCGGGCA CCATCGCCTT TCCCCGGTCG AGGCTCGAGG CGGACACCCT CTACGCCGGC
GCGCCGGCAA AGCCCGTGCG CAGGATCGAT GCGGCCGAAC GCGCTGCGCG GGCGGCGCGC
ATCCGCGCCC TCGCCAGCGC GCCTCCGCCT CCGCCGCCGG CCGGCGGCGC CGCACGGCTC
GATATTCACT CCACCACTTT CATCGCCGCC AACTCCTCCG TCTCAGGCGC CTTCAAGGCG
GATGCGCATG CGTCGGTGCT GTTCAGCTGC ACCCTCGACG CGCGCAACTC AGAAATCTCG
CTGGGCGAGA ATTCCAACAT CCAGGACAAC AGCTTGGTCC GCTGTCCCGA CGGCCCCGTC
GTCGTCGCCG CCAATGCCGT GGTCGGGCAC AACGTGGTGC TGGAGAGCTG CACCGTCGGC
ACCGGCTCGC TGGTGGGCAC CGGCAGCCGC GTGGCGCCCG GCACCGTCGT GGAGCCGGAC
GTCCTGCTCG CGGCCGGAGC CCGCACGCAG AGGGGCCAGG TGCTGGAAAG CGGCTTCCTG
TGGGGCGGAA ATCCAGCGCG CAGGATCGCG CCGCTCGACG ACAAGAAGCG CCAGATGATC
CCCTGGATCA TCTCCACCTA TTGCGAGTAC ACCGCCGACT TCCTCGCGAG CCAGCATCGC
GCGGCTCAGC GGGCCAGCTT CGGCTAA
 
Protein sequence
MSASPFVLPY HGILPVYPDL LRAGPRAALI GRVTLGPRAA LGTLALIRGD GHVVEIGADF 
HLGDWGTVHI AHEMHPAIVG DRVTVGPDAV VHACTVGDDC VIEEDAIILD GSVLENGVVM
EAGTIAFPRS RLEADTLYAG APAKPVRRID AAERAARAAR IRALASAPPP PPPAGGAARL
DIHSTTFIAA NSSVSGAFKA DAHASVLFSC TLDARNSEIS LGENSNIQDN SLVRCPDGPV
VVAANAVVGH NVVLESCTVG TGSLVGTGSR VAPGTVVEPD VLLAAGARTQ RGQVLESGFL
WGGNPARRIA PLDDKKRQMI PWIISTYCEY TADFLASQHR AAQRASFG