Gene SNSL254_A3643 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A3643 
SymbolaccC 
ID6482739 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp3532617 
End bp3533966 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content55% 
IMG OID642738918 
Productacetyl-CoA carboxylase biotin carboxylase subunit 
Protein accessionYP_002042635 
Protein GI194444408 
COG category[I] Lipid transport and metabolism 
COG ID[COG0439] Biotin carboxylase 
TIGRFAM ID[TIGR00514] acetyl-CoA carboxylase, biotin carboxylase subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.4794 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.0000582777 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTGGATA AAATTGTCAT CGCCAACCGC GGCGAGATCG CACTACGTAT TCTTCGAGCC 
TGTAAAGAAC TCGGCATCAA GACCGTCGCT GTGCACTCAA GCGCGGATCG CGATTTAAAA
CACGTATTGC TGGCGGATGA GACGGTCTGT ATTGGTCCGG CACCGTCCGT AAAAAGTTAT
CTGAACATCC CGGCTATCAT TAGCGCCGCT GAAATCACCG GCGCGGTGGC AATCCACCCG
GGTTACGGCT TCCTTTCTGA GAACGCCAAT TTTGCCGAGC AGGTTGAACG CTCCGGCTTT
ATCTTTATCG GCCCGAAAGC GGACACCATC CGCCTGATGG GCGATAAAGT GTCCGCGATT
ACCGCGATGA AAAAAGCGGG CGTGCCGACC GTACCAGGAT CTGACGGCCC GCTGGGCGAC
GATATGAATG CGAACCGCGC TCATGCCAAA CGTATTGGCT ATCCGGTGAT CATCAAAGCG
TCCGGCGGCG GCGGCGGCCG CGGTATGCGC GTGGTTCGTA GCGATGCTGA ACTGGCGCAG
TCCATCTCCA TGACCAAAGC GGAAGCGAAA GCGGCTTTCA GCAACGACAT GGTATACATG
GAAAAATACC TGGAAAATCC ACGCCACATC GAAATTCAGG TGCTGGCGGA CGGCCAGGGC
AACGCTATCT ATCTGGCGGA ACGCGACTGT TCCATGCAGC GTCGCCACCA GAAAGTGGTT
GAAGAAGCCC CGGCGCCAGG CATTACGCCG GAACTGCGTC GCTATATCGG CGAACGCTGC
GCGAAAGCGT GCGTAGACAT CGGCTATCGC GGCGCAGGGA CGTTCGAATT CCTGTTCGAA
AACGGCGAGT TCTATTTCAT CGAAATGAAC ACCCGTATTC AGGTTGAACA CCCGGTGACT
GAAATGATTA CTGGCGTCGA TTTGATCAAA GAGCAGTTGC GCATCGCGGC GGGTCAGCCG
CTGTCGATCA CACAGGACGA AGTTGTCGTT CGAGGCCATG CGGTAGAATG CCGTATCAAT
GCCGAAGATC CGAACACCTT CCTGCCAAGC CCAGGCAAAA TCACGCGCTT CCATGCGCCT
GGCGGCTTTG GCGTTCGCTG GGAATCTCAT ATCTACGCGG GCTACACGGT GCCGCCGTAC
TATGATTCCA TGATCGGCAA ACTCATCTGC TACGGTGAAA ACCGCGACGT GGCGATTGCC
CGTATGAAAA ATGCCCTGCA GGAACTGATT ATCGATGGTA TCAAAACCAA TATCGATCTG
CAGACCCGCA TCATGAATGA CGAGCACTTC CAGCACGGTG GAACCAACAT CCACTATCTG
GAGAAAAAAC TCGGTCTTCA GGAAAAGTAA
 
Protein sequence
MLDKIVIANR GEIALRILRA CKELGIKTVA VHSSADRDLK HVLLADETVC IGPAPSVKSY 
LNIPAIISAA EITGAVAIHP GYGFLSENAN FAEQVERSGF IFIGPKADTI RLMGDKVSAI
TAMKKAGVPT VPGSDGPLGD DMNANRAHAK RIGYPVIIKA SGGGGGRGMR VVRSDAELAQ
SISMTKAEAK AAFSNDMVYM EKYLENPRHI EIQVLADGQG NAIYLAERDC SMQRRHQKVV
EEAPAPGITP ELRRYIGERC AKACVDIGYR GAGTFEFLFE NGEFYFIEMN TRIQVEHPVT
EMITGVDLIK EQLRIAAGQP LSITQDEVVV RGHAVECRIN AEDPNTFLPS PGKITRFHAP
GGFGVRWESH IYAGYTVPPY YDSMIGKLIC YGENRDVAIA RMKNALQELI IDGIKTNIDL
QTRIMNDEHF QHGGTNIHYL EKKLGLQEK