Gene GM21_2654 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2654 
Symbol 
ID8137996 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3090999 
End bp3092339 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content61% 
IMG OID644870258 
Productacetyl-CoA carboxylase, biotin carboxylase 
Protein accessionYP_003022448 
Protein GI253701259 
COG category[I] Lipid transport and metabolism 
COG ID[COG4770] Acetyl/propionyl-CoA carboxylase, alpha subunit 
TIGRFAM ID[TIGR00514] acetyl-CoA carboxylase, biotin carboxylase subunit 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.00000129379 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTTCATA AAATTCTTAT CGCCAACAGG GGTGAGATCG CCCTCAGGAT CATCAGAACC 
TGCAAGGAGA TGGGGATCAA GACGGTCGCC GTGTACTCCA CGGCCGACAG CGAGTCGCTC
CATGTGAAGC TCGCCGACGA GAGCGTCTGC ATCGGCCCGG CCCCCAGCCT CTCCAGCTAC
CTCAACATCA ACGCCATCAT CTCCGCGGCG GAACTGACCG ACGCGGAGGC GATCCACCCG
GGGTACGGGT TCCTCTCCGA AAACCCGGTC TTCGCCGAGA TCTGCGAGAA GTGCGGCATC
ACCTTCATCG GACCTTCCGC CGAGAGCATG CGCATCATGG GCGACAAGAT CTCCGCCCGT
CAGGCGGTCA TCAAGGTCGG CGTCCCCATC CTTCCCGGCA CCAAGGAAGG GGTGCACGAC
GTAGCCGAGG CGATCAAGGT GGCCAAGGAG ATCGGCTTCC CGGTCATCAT CAAGGCAACG
GCTGGGGGCG GCGGACGCGG CATGAAGATC GTCCATTCCC CGGCGGCGCT TCCGAACGCC
TTCGCCACCG CGCGTGCCGA GGCGCAGTCC GGTTTCGGCA ATCCTGAGGT CTACATAGAG
CGCTACTGCG AGAGTCCGCG CCACGTCGAG ATCCAGATCC TCGCCGACAA GCACGGCAAC
GTGGTGCACC TGGGCGAGCG CGACTGCTCG ATCCAACGCC GTCACCAAAA GGTGATCGAG
GAGGCTCCCT CCACCGTCAC CACTCCGGAG CTGAGGAAAG CGATGGGCGA GGCTGCGGTC
GCCGCGGCCA AGGCCGTAAA CTACTGCAGC GTCGGCACCA TGGAATTCCT CGTCGACAAG
AACAACAACT TCTTCTTCAT GGAGATGAAC ACCCGCGTGC AGGTGGAGCA CCCGGTGACC
GAGATGGTGA CCGGCGTCGA CGTCGTGAAG GAGCAGATCC GCTCCGCATA CGGCCTCAAA
CTGCGCTACA CCCAGGACGA CATCAAGATC AAGGGACACT CCATCGAGTG CCGCATCAAC
GCGGAAGACT CGGTGAAGTT CACCCCTTGC CCGGGAAAGA TCACCGACCA CCACACACCC
GGCGGCTTAG GGGTCAGGGT CGATTCCTTC GTCTACACCA ACTACTCGGT CCTGCCGCAC
TACGACTCCC TGATCGCCAA GCTGATCGTG CATGCCGACA CCAGGGAAGA GGCGATCAAG
AGGATGGCTC GCGCGCTGGA CGAGTACATC GTGGAAGGGA TCAAGACCAC CATCCCGTTC
CACAAGAGAA TCATGGCCAA CAAAGACTTC ATCGAAGGGA ACATAGACAC CGGCTTCATC
GAAAGGCTGG TACTGGAGTA A
 
Protein sequence
MFHKILIANR GEIALRIIRT CKEMGIKTVA VYSTADSESL HVKLADESVC IGPAPSLSSY 
LNINAIISAA ELTDAEAIHP GYGFLSENPV FAEICEKCGI TFIGPSAESM RIMGDKISAR
QAVIKVGVPI LPGTKEGVHD VAEAIKVAKE IGFPVIIKAT AGGGGRGMKI VHSPAALPNA
FATARAEAQS GFGNPEVYIE RYCESPRHVE IQILADKHGN VVHLGERDCS IQRRHQKVIE
EAPSTVTTPE LRKAMGEAAV AAAKAVNYCS VGTMEFLVDK NNNFFFMEMN TRVQVEHPVT
EMVTGVDVVK EQIRSAYGLK LRYTQDDIKI KGHSIECRIN AEDSVKFTPC PGKITDHHTP
GGLGVRVDSF VYTNYSVLPH YDSLIAKLIV HADTREEAIK RMARALDEYI VEGIKTTIPF
HKRIMANKDF IEGNIDTGFI ERLVLE