Gene Saro_2079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2079 
Symbol 
ID3917727 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2217156 
End bp2218502 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content65% 
IMG OID640444832 
Productbiotin carboxylase / acetyl-coenzyme A carboxylase carboxyl transferase subunit alpha 
Protein accessionYP_497352 
Protein GI87200095 
COG category[I] Lipid transport and metabolism 
COG ID[COG0439] Biotin carboxylase 
TIGRFAM ID[TIGR00514] acetyl-CoA carboxylase, biotin carboxylase subunit 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.252313 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCATCA AGCGCCTGCT GATCGCCAAT CGCGGCGAGA TCGCGCTGCG CATCCATCGC 
GCGGCTCACG AAATGGGCAT CGAGACGGTC GCGGTGCACT CCACCGCCGA TGCCGATGCG
ATGCACGTGC GCCTTGCCGA CCATGCGGTC TGCATCGGCC CGCCCGCCGC GAAGGACAGC
TATCTCAACG TCGCCGCGAT CATCTCGGCT GCCGAGATCA CCCATGCCGA CGCGATCCAT
CCGGGCTACG GCTTCCTTTC GGAAAACGCC AAGTTCGCGG AAATCGTCGA AGCGCACGGT
ATCACCTGGG TCGGTCCCAA GCCAGAACAC ATCCGCACGA TGGGCGACAA GGTCGAGGCC
AAGCGCACCG CGGGCGCGCT CGGCCTGCCG CTGGTCCCCG GTTCCGACGG CGCGGTGTCC
GAAATCGACG AGGCCAAGAA GATCGCCGAT GCCATCGGCT ATCCGGTGAT CATCAAGGCA
GCCTCGGGCG GCGGCGGTCG CGGCATGAAG GTTTGCAACA GCGCCGACCA GCTCGAAACG
CTGATGCAGC AAGCCGGCAG CGAGGCGAAG GCCGCGTTCG GCGATGCCAC CGTCTATATC
GAGAAGTATC TCGGCAACCC GCGCCACATC GAATTCCAGA TCTTCGGTGA CGGCAACGGC
AACGCGATCC ACCTGGGCGA GCGCGACTGC TCGCTCCAGC GCCGCCACCA GAAGGTGCTC
GAGGAAGCGC CCTCGCCCGT CATCTCGGCC GACGAACGTG CGCGCATGGG CGGCATCGTC
GCCAAGGCCA TGGCCGACAT GGGGTATCGC GGCGCGGGCA CGATCGAGTT CCTGTGGGAG
AACGGCGAGT TCTACTTCAT CGAGATGAAC ACCCGCCTTC AGGTGGAACA TCCGGTGACC
GAGGCGATCA CCGGCGTCGA CCTGGTGCGC GAACAGATCC GCATTGCCGA TGGCAAGCCG
CTTTCGGTCA CGCAGGACGA GATCGAGTTC AAGGGACACG CGATCGAGTG CCGCATCAAT
GCGGAAGACC CGTTCACATT TGCCCCCTCG CCGGGACTGG TGAAGAGCTA TCACGCAGCG
GGCGGCATGC ACGTGCGCGT CGATTCAGGT CTCTACGCCG GGTACAAGAT CCCGCCGTAC
TATGACTCGA TGATTGCCAA GCTGATCGTC TACGGCCGGA CCCGCGAAGG CTGCATCATG
CGGCTGAAGC GCGCGCTCGA GGAAATGGTG ATCGAAGGCC CCAAGACCTC GATCCCGCTC
CACCAGCGCC TGCTGAGCCA GCCCGACTTC CTCAGCGGCG ACTACACGAT CAAGTGGCTC
GAGGAATGGC TGGCCAAGGA CGCCTGA
 
Protein sequence
MAIKRLLIAN RGEIALRIHR AAHEMGIETV AVHSTADADA MHVRLADHAV CIGPPAAKDS 
YLNVAAIISA AEITHADAIH PGYGFLSENA KFAEIVEAHG ITWVGPKPEH IRTMGDKVEA
KRTAGALGLP LVPGSDGAVS EIDEAKKIAD AIGYPVIIKA ASGGGGRGMK VCNSADQLET
LMQQAGSEAK AAFGDATVYI EKYLGNPRHI EFQIFGDGNG NAIHLGERDC SLQRRHQKVL
EEAPSPVISA DERARMGGIV AKAMADMGYR GAGTIEFLWE NGEFYFIEMN TRLQVEHPVT
EAITGVDLVR EQIRIADGKP LSVTQDEIEF KGHAIECRIN AEDPFTFAPS PGLVKSYHAA
GGMHVRVDSG LYAGYKIPPY YDSMIAKLIV YGRTREGCIM RLKRALEEMV IEGPKTSIPL
HQRLLSQPDF LSGDYTIKWL EEWLAKDA