Gene Daro_3922 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3922 
Symbol 
ID3567653 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp4218001 
End bp4219446 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content60% 
IMG OID637682396 
Productcarbonic anhydrase 
Protein accessionYP_287120 
Protein GI71909533 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3338] Carbonic anhydrase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones67 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCACC TGATCATCGC CAGCCTGCTG GCCGCCCTGC CTTGGGCTGC TTCGGCTGCC 
CCAACCTGGC AAACCATCTC GTCAGAACCG GGCAAGCGCA TCGAGATCGA CCGCACCAGC
CTGAAGCGCG AAGGGAGCAC CGTGCAGGCT CAAGGGCGCA TCGTCCTTGA AAAAGAGCTG
ACTGACGCAA AATCAGGCGC CGGCTACCGG GTCATCGAAG CAATTACCCG CTACGACTGC
AACACGCGCA ACGCCAACAC GATCAAACGC ATTTTCAAGA AAAACGAAAA CGAAGTCATC
CGCGAGGAAG AAATCAAGGG CTCCGACCTC CCGGTACGCA CCGGCACGCT GGACGACAAG
GTATTGCGTG AAGTCTGCCG CCCGCCGAAG GAAAGCCCGG CAGAACTGGC CAAAAAAGCC
AATGAAGCAG CTGGCGAACT GAAGGCTGCC AACGACGCGC TGCTCAAGAA GGAAATGGCC
AAGGCCGAAA AGCCGGCAAC CATCAAGGCC AGCGATGTGC CGGACAAGGA AGCGGAACAC
GGCGCCATTC CCTCGATCCG CCCAAACCTG AAGGCAGCAA CGGAAAGCGC CAAGGAGACG
GCACCAGCCC CAACGCCGGC AGCTGCGCCG GCCAAAGCAG TGGCCCCGGC AAAAGCGGCG
ACCGTCGTCG TGCACACCAC CCCAGCCCCA GCGCCCAAAG CCAGGAAGCC AGCCAGGTCT
GAAGGCTATA TGCTGGAATT GACTCATTCC GAACCTGCCG CACAGCACGC CCAAATTCAC
TGGGCCTACG ATGGTGCCGG CGCCCCGGAA AACTGGCCCA ATCTCGACCC GAAGAACAAG
GTGTGCGCGA TCGGCGAGCG CCAGTCACCA ATCGACATCA AGGACGGCAT CAAGGTCGAC
CTGGAGCCGA TCAAGTTCAA GTACCAGCCC TCTACCTTCA GGATCGTCGA CAACGGCCAT
ACCGTGCAGG TTGAAGTCGG CGATGGCTCG ATTTCTCTGA CCGGCAAAAC CTATGAACTG
GTCCAGTTCC ACTTCCATCG CCCGTCTGAA GAAAAGGTAA ACGGCCAGCG CTTCGACATG
GTCGTCCATC TGGTGCACAA GTCGGATGAC GGGCAACTCG CTGTTGTCGC CGTGCTGCTC
GAACGTGGTA CCGAGAACCC CTTCATCCAG ACGCTGTGGA ACAACATGCC ACTGGAAAAG
AACATGGCCG TTGCCCCTCC GACGACTACC ATCGATCTGA ACACCCTGCT ACCAGCTACC
CGCAACTACT ACACCTACAT GGGCTCGCTG ACCACGCCAC CGTGCTCCGA AGGGGTGCTG
TGGCTGGTCA TGAAACAACC GGTGCAAGTT TCGCAGGATC AGATCAACAT TTTCAGCCGC
CTGTACAAAA ACAACGCCCG GCCGATCCAG CCCTCCGGCG GACGCCTGAT CAAGGAAGGC
CGTTGA
 
Protein sequence
MRHLIIASLL AALPWAASAA PTWQTISSEP GKRIEIDRTS LKREGSTVQA QGRIVLEKEL 
TDAKSGAGYR VIEAITRYDC NTRNANTIKR IFKKNENEVI REEEIKGSDL PVRTGTLDDK
VLREVCRPPK ESPAELAKKA NEAAGELKAA NDALLKKEMA KAEKPATIKA SDVPDKEAEH
GAIPSIRPNL KAATESAKET APAPTPAAAP AKAVAPAKAA TVVVHTTPAP APKARKPARS
EGYMLELTHS EPAAQHAQIH WAYDGAGAPE NWPNLDPKNK VCAIGERQSP IDIKDGIKVD
LEPIKFKYQP STFRIVDNGH TVQVEVGDGS ISLTGKTYEL VQFHFHRPSE EKVNGQRFDM
VVHLVHKSDD GQLAVVAVLL ERGTENPFIQ TLWNNMPLEK NMAVAPPTTT IDLNTLLPAT
RNYYTYMGSL TTPPCSEGVL WLVMKQPVQV SQDQINIFSR LYKNNARPIQ PSGGRLIKEG
R