Gene GSU2019 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU2019 
SymbolaccC 
ID2688045 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp2212199 
End bp2213539 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content57% 
IMG OID637126710 
Productacetyl-CoA carboxylase, biotin carboxylase 
Protein accessionNP_953068 
Protein GI39997117 
COG category[I] Lipid transport and metabolism 
COG ID[COG4770] Acetyl/propionyl-CoA carboxylase, alpha subunit 
TIGRFAM ID[TIGR00514] acetyl-CoA carboxylase, biotin carboxylase subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.274625 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTCATA AAGTTCTGAT TGCAAATCGT GGCGAAATCG CCCTGAGGGT CATCCGGGCC 
TGCAAAGAGC TGGGAATCAA GACCGTTGCC GTCTACTCGA CAGCGGACAG GGATTCGCTC
CATGTGAAGC TCGCGGATGA GAGTGTCTGC ATCGGTCCCG CGCCGAGCCT GCAGAGCTAT
CTTAATATTA ACGCCATCAT TTCGGCTGCC GAATTGACCG ATGCCGAGGC AATCCACCCC
GGCTACGGTT TTCTGTCGGA AAATGCCGCT TTTGCCGAAA TCTGTGAAAA CTGCGGTATA
ACCTTCATCG GCCCCTCTTC ACAAAGCATG CGCATCATGG GCGACAAGAT CAGCGCCCGG
CAGGCGGTGA TCAAGGAAAA TGTGCCGATC CTGCCCGGCA CCAAGGAGGG GGTTAACGAC
GTCAACGAGG CGGTGAAGAT CGCCAAGAAG ATCGGCTTCC CCGTCATCAT CAAGGCGACT
GCCGGAGGCG GTGGCCGGGG GATGAAGATC GTGCACTCCC CGGCGGCCCT CCCCAACGCC
TTTGCCACGG CTCGCGCCGA GGCTCAGGCC GGTTTCGGCA ACCCTGAGGT CTACATTGAG
AAGTATTGCG AGAAGCCGCG CCACGTTGAG ATCCAGGTCA TGGCCGACAA GCACGGTAAC
GTGATTCACC TGGGTGAGCG GGACTGCTCC ATCCAGCGTC GCCACCAGAA GATCATCGAG
GAGTCGCCGT GCCCGGTCAT GACTCCTGCA CTCCGCAAGG CCATGGGTGA TGCGGCTGTT
CGCGCGTCCA AGGCAGTGGG GTACGACAGT GTCGGCACCG TTGAGTTCCT GGTGGACAAG
GACCTCAACT TCTATTTCAT GGAAATGAAT ACCCGGGTGC AGGTTGAGCA TCCGGTGACC
GAAATGGTGA CCGGCATCGA CATCGTCCGG GAGCAGATCC GTTCTGCAGC CGGTCTCAAG
CTTCGTTACA AGCAAAGCGA CATTAAACTG CACGGTCACG CTATTGAATG CCGTATCAAT
GCTGAAGATC CGGTGAAGTT CACCCCGTCG CCGGGCAAGA TCGTCGGTTA CCATACCCCG
GGAGGTCTGG GTGTGCGGAT CGATTCTTTC GTCTATGATC AGTATTCCGT GGTCCCCCAC
TACGACTCGC TCATAGCGAA GCTGATCGTC CACGCAGAGA CCAGGGAAGA CGCCATCCGC
CGCATGGCCC GCGCCCTTGA CGAGTACATC ATTGAGGGCA TCAAGACCAC AATCCCCTTC
CATAAGAGGA TCATGGACAA CAAAGACTTT ATGGAGGGGA ATGTCGACAC CGGCTTCCTC
GAGCGAATCG TGCTGGAGTA G
 
Protein sequence
MFHKVLIANR GEIALRVIRA CKELGIKTVA VYSTADRDSL HVKLADESVC IGPAPSLQSY 
LNINAIISAA ELTDAEAIHP GYGFLSENAA FAEICENCGI TFIGPSSQSM RIMGDKISAR
QAVIKENVPI LPGTKEGVND VNEAVKIAKK IGFPVIIKAT AGGGGRGMKI VHSPAALPNA
FATARAEAQA GFGNPEVYIE KYCEKPRHVE IQVMADKHGN VIHLGERDCS IQRRHQKIIE
ESPCPVMTPA LRKAMGDAAV RASKAVGYDS VGTVEFLVDK DLNFYFMEMN TRVQVEHPVT
EMVTGIDIVR EQIRSAAGLK LRYKQSDIKL HGHAIECRIN AEDPVKFTPS PGKIVGYHTP
GGLGVRIDSF VYDQYSVVPH YDSLIAKLIV HAETREDAIR RMARALDEYI IEGIKTTIPF
HKRIMDNKDF MEGNVDTGFL ERIVLE