Gene Ssed_4090 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSsed_4090 
Symbol 
ID5613660 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sediminis HAW-EB3 
KingdomBacteria 
Replicon accessionNC_009831 
Strand
Start bp5003935 
End bp5005305 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content50% 
IMG OID640935045 
Productacetyl CoA carboxylase, biotin carboxylase subunit 
Protein accessionYP_001475822 
Protein GI157377222 
COG category[I] Lipid transport and metabolism 
COG ID[COG0439] Biotin carboxylase 
TIGRFAM ID[TIGR00514] acetyl-CoA carboxylase, biotin carboxylase subunit 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTGC CAAGTGAGCG ACCAATAAAT CGCGTGCTTA TCGCCAACCG AGGTGAGATA 
GCACTCAGAA TACAGCGTGC CTGCGCCGAA CTAGGCATAG AAACGGTGGC CATTCATTCA
ACGGCCGATC GCCAGCAACT ACATCTGGAA TATGCCAGCC AAACACTCTG TATTGGGAAA
GCGCCGGCAT TAAACAGTTA TCTCAATATT ACCGGTATTA TTTCGGCGGC ATCGCTTTCA
AAAACAGATG CGATACATCC GGGGTACGGA TTCCTGTCAG AAAATGCCGA CTTTGCCGAA
CAAGTTGAAA GTAGCGGCTT TGTTTTCATC GGTCCAACCC CTGAAGTGAT TCGTCTAATG
GGTGACAAGA TCTCAGCCAT TGCGGCAATG AAGCAAGCCG GAGTGCCCAC CGTTCCGGGC
TCCGACGGAA GCTTAACCGA CAATCATCAG CATAATAAAA AGCTTGCAGA GCAGATAGGT
TACCCCGTAA TCATAAAAGC CACCGCCGGT GGTGGTGGAC GAGGCATGAG GGTCATAAAG
GGGGAAGCTG AGCTCATTCA GGCTATCGAA CTAACCCGGG CCGAGGCCTT GGCCGCATTT
GCCAATGACT CGCTATATAT GGAGAAGTAT CTTGAAACGC CTCGCCATAT TGAAATTCAG
GTGCTGTCTG ATGGCCAGGG AAACGCTATC CACTTAGGTG AGCGCGACTG CTCGATGCAG
CGAAAACATC AAAAGGTCAT CGAGGAGGCC CCTGCCCTAG GTATCGATGA GCAAACACGC
CATACCATAG GTGAGCTCTG TGCTAAGGCC TGCATAGAGA TAGGTTATCG CGGCGTTGGT
ACCTTCGAGT TTCTCTATGA AGCAAACCGG TTCTACTTTA TTGAGATGAA CACACGCATT
CAGGTAGAGC ATCCCATCAC TGAGATGATC ACCGATGTCG ATATTATCAA GGCGCAGCTG
GAAATTGCCT CAGGTTTACC TCTCAAACTT AAGCAGGAAG ATGTCCCCCT CAACGGTCAC
GCCATCGAGT GCCGGATCAA TGCAGAAGAT CCCGTTAACT TTGTCCCCTC GCCAGGCCTC
ATTACTCAGT TTCACGCCCC CGGGGGAATT GGTGTTCGCT GGGACTCTCA CCTCTATCAG
GGTTACAGCG TCCCTCCCTA TTATGATTCC ATGATTGGAA AACTGATCAC CTGGGGAGAG
GACAGAGCGA CCGCCATCGC ACGCATGCAG TTGGCGCTCA ATGAGCTAAA AATTGAGGGC
ATAAAAACCA ATATCCCTCT GTTAAAGAAG ATACTGGCTG ATGAAGGCTT CAGGCAGGGC
GGACAATCGA TACATTATCT TGAGAAAGAG ATCCTAGAAC CTAGGAACTA G
 
Protein sequence
MKLPSERPIN RVLIANRGEI ALRIQRACAE LGIETVAIHS TADRQQLHLE YASQTLCIGK 
APALNSYLNI TGIISAASLS KTDAIHPGYG FLSENADFAE QVESSGFVFI GPTPEVIRLM
GDKISAIAAM KQAGVPTVPG SDGSLTDNHQ HNKKLAEQIG YPVIIKATAG GGGRGMRVIK
GEAELIQAIE LTRAEALAAF ANDSLYMEKY LETPRHIEIQ VLSDGQGNAI HLGERDCSMQ
RKHQKVIEEA PALGIDEQTR HTIGELCAKA CIEIGYRGVG TFEFLYEANR FYFIEMNTRI
QVEHPITEMI TDVDIIKAQL EIASGLPLKL KQEDVPLNGH AIECRINAED PVNFVPSPGL
ITQFHAPGGI GVRWDSHLYQ GYSVPPYYDS MIGKLITWGE DRATAIARMQ LALNELKIEG
IKTNIPLLKK ILADEGFRQG GQSIHYLEKE ILEPRN