Gene TM1040_1518 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1518 
Symbol 
ID4077074 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1624175 
End bp1625524 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content61% 
IMG OID638006831 
Productacetyl-CoA carboxylase biotin carboxylase subunit 
Protein accessionYP_613513 
Protein GI99081359 
COG category[I] Lipid transport and metabolism 
COG ID[COG0439] Biotin carboxylase 
TIGRFAM ID[TIGR00514] acetyl-CoA carboxylase, biotin carboxylase subunit 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0122051 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.737322 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTGACA AAATCTTAAT TGCCAACCGG GGTGAGATCG CGCTGCGCGT GATCCGTGCC 
TGCCGCGAGA TGGGCATCCA ATCCGTTGCG GTGCACTCCA CCGCCGACGC CGATGCCATG
CATGTGCGCA TGGCCGATGA GTCGATCTGC ATTGGCCCAC CGTCAGGCGC CCAGAGCTAT
CTGTCGATCC CCGCGATCAT CTCCGCCTGC GAAATTTCGG GCGCTCAGGC GATCCACCCG
GGCTACGGGT TTCTCTCCGA GAACGCCAAT TTTGTGCAGA TCGTCGAGGA CCACGGGCTG
ACCTTCATCG GCCCCTCCGC AGAACATATT CGCCAGATGG GCGACAAGAT CACCGCAAAA
GACACCGCAA AAGCGCTCGG CATTCCTTGT GTGCCGGGAT CTGACGGCGG TGTGCCCGAT
GTGGAGACCG CAAAAAAGGT CGCCGCCGAA ATGGGCTATC CGGTGATCAT CAAGGCCACC
GCCGGTGGCG GCGGGCGCGG CATGAAAGTG GCCCAGACCG AAGCAGATCT GGTTCAAGCG
TTCCAGACCG CGCGCTCAGA AGCCAAAGCG GCCTTTGGCA ACGACGAAGT CTACATGGAG
AAATACCTCC AGCGTCCGCG CCACATCGAG ATCCAGGTCT TTGGTGATGG CAAAGGCGGC
GGCGTACACC TGGGCGAGCG CGACTGCTCC TTGCAGCGGC GCCACCAGAA GGTCTTTGAA
GAGGCTCCCG GCCCCTGCAT CACCGAGGAA GAGCGTGCCA AGATCGGCAA GATCTGCGCG
GATGCGATCG GCAAGATGGG CTACTCCGGT GCGGGTACCG TCGAATTCCT CTATGAGGAT
GGCGAGTTCT ACTTCATCGA GATGAACACC CGCCTGCAGG TGGAACACCC TGTGACCGAA
GCCATCTTTG GCGTCGACCT GGTGCGTGAA CAGATCCGCG TTGCCTCTGG CCTGCCGCTC
TCGTTCACGC AGGATGATCT GATCATCAAC GGTCACGCCA TCGAAGTGCG CATCAATGCC
GAAAAGCTGC CGAATTTCTC GCCCTGCCCG GGTAAGATCA CCGCATATCA TGCCCCCGGC
GGCCTTGGCG TGCGGATGGA TTCGGCGCTT TATGACGGCT ACTCGATCCC GCCGTACTAC
GACAGCCTGA TCGGCAAGCT GATCGTCCAC GGGCGCGACC GGGAAGAGGC CCTTGCGCGT
CTCAGCCGCT CGCTGGGCGA GCTCATTGTG GACGGGATCG ATACCACGGT GCCGCTGTTC
CACGCGCTCT TGCAGGAAAA AGACATCCAC ACCGGTGAGT ACAACATTCA CTGGCTGGAA
AAATGGCTTG AAGCCAACCT TCAGGCCTGA
 
Protein sequence
MFDKILIANR GEIALRVIRA CREMGIQSVA VHSTADADAM HVRMADESIC IGPPSGAQSY 
LSIPAIISAC EISGAQAIHP GYGFLSENAN FVQIVEDHGL TFIGPSAEHI RQMGDKITAK
DTAKALGIPC VPGSDGGVPD VETAKKVAAE MGYPVIIKAT AGGGGRGMKV AQTEADLVQA
FQTARSEAKA AFGNDEVYME KYLQRPRHIE IQVFGDGKGG GVHLGERDCS LQRRHQKVFE
EAPGPCITEE ERAKIGKICA DAIGKMGYSG AGTVEFLYED GEFYFIEMNT RLQVEHPVTE
AIFGVDLVRE QIRVASGLPL SFTQDDLIIN GHAIEVRINA EKLPNFSPCP GKITAYHAPG
GLGVRMDSAL YDGYSIPPYY DSLIGKLIVH GRDREEALAR LSRSLGELIV DGIDTTVPLF
HALLQEKDIH TGEYNIHWLE KWLEANLQA