Gene Phep_3289 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3289 
Symbol 
ID8254408 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp3900581 
End bp3902071 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content45% 
IMG OID644936941 
Productacetyl-CoA carboxylase, biotin carboxylase 
Protein accessionYP_003093545 
Protein GI255533173 
COG category[I] Lipid transport and metabolism 
COG ID[COG4770] Acetyl/propionyl-CoA carboxylase, alpha subunit 
TIGRFAM ID[TIGR00514] acetyl-CoA carboxylase, biotin carboxylase subunit 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.191323 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAAG TTCTTATCGC CAATAGGGGG GAAATTGCTT TGCGTATCAT GCGTTCTGCA 
AAAGAAATGG GCATCAAAAC GGTTGCTGTA TATTCTGAGG CTGACAGGCA ATCGCTGCAT
GTGCGCTATG CGGATGAGGC AGTATGCATT GGTCCGGCAC CCTCAAATCA ATCTTACCTC
ATCGGAGAAA AAATTATTGA AGCCTGTAAA ATTACCGGGG CAGAAGCCAT TCATCCGGGC
TATGGCTTTT TATCAGAAAA CGCTGCTTTT GCGCGGCTGG TAAAAGCAGC TGGTCTGACA
TTAATAGGTC CGACACCGGA AGCCATGGAA ATCATGGGCA ATAAACTTTC TGCAAAAGCT
GCGGCATTGA AATACCAGAT TCCCATGGTT CCTGGTACTG AAGAAGCCAT TACGGATGTT
GAAGAAGCTA AACGAAGAGC AATAGAAGTA GGTTTCCCTA TCCTGATCAA AGCTGCTGCC
GGAGGTGGGG GAAAAGGAAT GCGTATTGTG GAAAAAGCTG CTGAATTTGA AGAGCAGATG
CAACTGGCCG TAAGTGAGGC CCTGTCTGCT TTTGGAGATG GTTCCGTGTT TATCGAACGT
TATGTTTCCT CTCCGCGACA CATAGAAATA CAGGTGCTGG GCGATACCCA TGGAAATATT
GTACACCTCT TTGAAAGGGA GTGTTCTGTT CAGCGCAGAC ACCAAAAAGT AATTGAAGAA
GCACCGTCAA GCATTTTAAC AGCTGAAATA AGGGAAAAGA TGGGCAAATG TGCAGTTGAT
GTGGCCCGTT CAGTAAATTA TGTAGGTGCT GGTACAGTGG AGTTTATCCT GGATGAAAAT
CTGGACTTTT TCTTCCTGGA AATGAATACC CGCTTGCAGG TAGAACACCC GGTAACTGAA
ATGATTACGG GTTTAGACCT TGTAAAAGAG CAGATAAAAA TTGCCAGGGG AGAAAAACTA
AGCTATAAGC AGGAAGATCT GCACATTAAC GGGCATGCCA TAGAACTGAG GGTATATGCT
GAAGATCCTG AAAATAATTT CCTGCCGGAT ATAGGTGTGT TGCAGACCTA TAAAACTCCT
AAGGGCAATG GCGTAAGGGT AGATGACGGG TTTGAGCAGG GAATGGAAAT CCCGATTTAT
TACGACCCGA TGATTGCCAA ACTCATTACC TATGGTAAAG ACCGGGAAGA AGCCATTGAA
CGGATGGTCC GTGCCATTGG TGAATACCAG ATCACGGGTA TTCAAACCAC ACTTGGTTTT
GGTAAATTTG TGATGCAGCA TGAAGCCTTT AAGTCTGGTA AATTCGATAC GCATTTTGTA
GCTAAATACT TTAAGGCCAA TAGCCCGAAA GTGCAAAATG AAGACGAAGC TTTATTGGCA
GCTATGATGG GAGCATTTTT TTTCAAACAG CAGCCACTTG CAGCACCGCA ACCGTTACAG
GAAGGCCAGG CCAATGCCCT CAACTGGAGA AGAAACAGGT TAAATAAATA G
 
Protein sequence
MKKVLIANRG EIALRIMRSA KEMGIKTVAV YSEADRQSLH VRYADEAVCI GPAPSNQSYL 
IGEKIIEACK ITGAEAIHPG YGFLSENAAF ARLVKAAGLT LIGPTPEAME IMGNKLSAKA
AALKYQIPMV PGTEEAITDV EEAKRRAIEV GFPILIKAAA GGGGKGMRIV EKAAEFEEQM
QLAVSEALSA FGDGSVFIER YVSSPRHIEI QVLGDTHGNI VHLFERECSV QRRHQKVIEE
APSSILTAEI REKMGKCAVD VARSVNYVGA GTVEFILDEN LDFFFLEMNT RLQVEHPVTE
MITGLDLVKE QIKIARGEKL SYKQEDLHIN GHAIELRVYA EDPENNFLPD IGVLQTYKTP
KGNGVRVDDG FEQGMEIPIY YDPMIAKLIT YGKDREEAIE RMVRAIGEYQ ITGIQTTLGF
GKFVMQHEAF KSGKFDTHFV AKYFKANSPK VQNEDEALLA AMMGAFFFKQ QPLAAPQPLQ
EGQANALNWR RNRLNK