Gene Acid345_3287 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3287 
Symbol 
ID4072699 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3892983 
End bp3894338 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content60% 
IMG OID637985308 
Productacetyl-CoA carboxylase, biotin carboxylase 
Protein accessionYP_592362 
Protein GI94970314 
COG category[I] Lipid transport and metabolism 
COG ID[COG0439] Biotin carboxylase 
TIGRFAM ID[TIGR00514] acetyl-CoA carboxylase, biotin carboxylase subunit 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.195877 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTCGAA AGATATTAAT TGCTAACCGC GGCGAAATCG CGCTGCGGGT GATTTGCGCC 
TGCAAGGAAC TCGGCATCAA GACGGTCGCC ATCTACAGCG AGGCCGACCG GAATTCCTTG
CACGTGCGCT TTGCCGACGA AGCCATCTGC ATCGGTCCGC CGCGCCTGGC GGACAGCTAC
CTGAATATCC CTGCCGTGAT CAGCGCCGCT GAAATCGCGA ACGTGGATGC GATCCATCCC
GGCTACGGCC TGCTCAGCGA AAACGCGAAC TTCGCGGAAG TGTGCCAGGC CTGCGGCATC
GAGTTCATCG GCCCGAAGCC AGAGACCATT CGCCTGATGG GCGAAAAGGA AAAAGCTCGC
GCTGCGATGA AGGAGCATGG GGTGCCGATC CTGCCCGGCT CCGATGGTGT GGTGGCGACC
GAAGCCGAGG CGATGGAGTG GGCGAAGAAG ATCGGCTTCC CGGTGATTGT GAAGGCAAAA
GCGGGAGGCG GTGGACGCGG CATGCGCGTT ATCCGCAGCG AAGAAGAACT TCCTGTCCAT
TTTGCTGCCG CAAGTTCAGA AGCAGCGTCG GCGTTCGGCA ATGGCGATCT CTACATGGAA
AAGTTTGTCG AGCGCCCGCG CCACATCGAG TTCCAGATAC TCGCCGATTC GTACGGCAAT
GTGGTTTCGC TCGGCGAGCG CGAGTGTTCC ATCCAGCGGC GTCACCAAAA GCTGTTGGAG
GAAGCGCCAA GCACGCAGGT CACACCGGAG TTGCGCGCGG AAATCGGCGG CATCCTCGAG
AAGACGCTGT CGAAGATCGG TTACATCAAC GCGGGAACCA TCGAGTTCCT GATGGATGAA
GACCGAAAGC TGTACTTCAT CGAGATGAAC ACCCGCATCC AGGTCGAACA CCCGGTCACC
GAGATGACAA CGGACGTGGA CCTGGTGAAA GGGCAGATCA TGATCGCCGC GGGCGCGAAG
CTCCAGGACG TTCTGATGGG GCCGATCGTC TTCCGCGGGC ACGCCATCGA GTGCCGCATC
AACGCGGAGC ATCCGGAGAA GTTCACGCCA TCTGCCGGAA AGATTACGGC GTTCCATACG
CCGGGCGGCA CGGGCGTGCG CGTCGATACG CACCAGTACG CCGAGGGCGT CATTCCGCCG
TATTACGATT CGCTCATCGC AAAGCTGGTG GTGCGCGGAA AAGACCGCGA CGAAGCGATC
TCACGCATGG CGCGCGCGCT GGAGATGTTC ATCGTCGAGG GCATTCACAC CTCGATCCCG
CTGCACCGCA AGATCATGGC TGACCCAGAT TTCCGCGCCG GCAACTTCGA TACGAAGTTC
ATGGAGCGCT TCATGACGCA GACCCACAAA AAGTAA
 
Protein sequence
MFRKILIANR GEIALRVICA CKELGIKTVA IYSEADRNSL HVRFADEAIC IGPPRLADSY 
LNIPAVISAA EIANVDAIHP GYGLLSENAN FAEVCQACGI EFIGPKPETI RLMGEKEKAR
AAMKEHGVPI LPGSDGVVAT EAEAMEWAKK IGFPVIVKAK AGGGGRGMRV IRSEEELPVH
FAAASSEAAS AFGNGDLYME KFVERPRHIE FQILADSYGN VVSLGERECS IQRRHQKLLE
EAPSTQVTPE LRAEIGGILE KTLSKIGYIN AGTIEFLMDE DRKLYFIEMN TRIQVEHPVT
EMTTDVDLVK GQIMIAAGAK LQDVLMGPIV FRGHAIECRI NAEHPEKFTP SAGKITAFHT
PGGTGVRVDT HQYAEGVIPP YYDSLIAKLV VRGKDRDEAI SRMARALEMF IVEGIHTSIP
LHRKIMADPD FRAGNFDTKF MERFMTQTHK K