Gene TM1040_1869 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1869 
Symbol 
ID4077894 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1969271 
End bp1971301 
Gene Length2031 bp 
Protein Length676 aa 
Translation table11 
GC content62% 
IMG OID638007185 
Productcarbamoyl-phosphate synthase L chain, ATP-binding 
Protein accessionYP_613864 
Protein GI99081710 
COG category[I] Lipid transport and metabolism 
COG ID[COG4770] Acetyl/propionyl-CoA carboxylase, alpha subunit 
TIGRFAM ID[TIGR00514] acetyl-CoA carboxylase, biotin carboxylase subunit 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTGAGA AGATCCTGAT CGCCAACCGG GGGGAAATTG CCTGTCGCGT CATCAAGACC 
GCGCGCAAGA TGGGCATCAA GACCGTCGCC ATTTATTCCG ACGCCGACCG TCAGGCGCTG
CATGTGCAGA TGGCGGATGA GGCCGTGCAT GTGGGCCCTG CGCCTGCCAA CCAATCCTAC
ATCGTCATCG ACAATGTGAT GGCCGCGATC AAATCCTCGG GCGCGCAGGC GGTGCATCCG
GGCTATGGCT TCCTGTCGGA AAACGCCAAA TTCGCCGAGG CGCTGGAGGC CGCAGGCGTC
GCCTTTGTTG GCCCGCCCAA AGGCGCGATT GAGGCGATGG GGGACAAGAT CACCTCGAAG
AAAATCGCCC AGGAAGCAGG CGTGTCGACC GTGCCCGGCT ACATGGGCCT GATCGCGGAC
GCCGATGAGG CGGTGAAGAT CTCCAACGAG ATCGGCTATC CGGTGATGAT CAAGGCCTCT
GCCGGGGGCG GCGGCAAGGG CATGCGGATT GCCTGGACCG ACGAAGAGGC CCGCGAGGGC
TTTCAGTCCT CCAAAAACGA GGCCGCGAAC TCCTTTGGCG ATGACCGGAT CTTCATTGAG
AAATTCGTGA CGCAACCGCG CCACATCGAA ATTCAGGTGC TCTGCGATGC CCATGGCAAC
GGCGTTTACT TGGGCGAGCG CGAATGCTCC ATCCAGCGCC GCAACCAGAA GGTCGTCGAA
GAGGCGCCGA GCCCCTTCCT CGATGAAGAG ACCCGCCGCG CCATGGGCGA GCAATCCGTC
GCGCTGGCCA AGGCCGTGGG CTATGCCTCT GCGGGCACCG TGGAATTCAT CGTCGACGGC
GACAAGAACT TCTACTTCCT CGAGATGAAC ACCCGCCTGC AGGTGGAACA TCCCGTGACC
GAACTCATCA CTGGTGTGGA CCTTGTGGAG CAGATGATCC GCGTGGCCGC CGGCAAGGAG
CTGTCGATCA CTCAGAATGA TGTCAAACTT ACCGGCTGGG CGATTGAAAA CCGCCTTTAT
GCCGAAGATC CCTATCGCAA CTTCCTGCCC TCCATCGGGC GTCTCACCCG CTATCGTCCC
CCGGCAGAAA CCGCGGCCTA CACGCCCGGC GTCGCGCCCG GAGATGCGGG CGATGTGGTC
GTGCGTAACG ACACCGGCGT CTATGAAGGC GGTGAGATTT CAATGTATTA CGACCCGATG
ATCGCCAAGC TCTGCACCTG GGCACCGACC CGTGATGCGG CGATCGAGGC GATGCGCGCG
GCGCTTGACA GTTTCGAGGT CGAAGGCATC GGTCACAACC TGCCGTTCCT TTCGGCGGTG
ATGGATCATC CGAAGTTTGT TTCGGGCGAG ATGACCACCG CCTTTATCGC CGAGGAATAC
CCCGAGGGGT TTGACGGCGT CGATCTGCCG GAAAGCGATC TGAAGCGCAT CGCGGCCTCT
TGTGCGGCCA TGCACCGGGT TGCCGAAATC CGCCGCACGC AGGTCTCGGG CCGCATGGAC
AACCACGAAC GCCGGGTGGG CAACACCTGG GTGGTGGCCA TTGGCGGGCA GACCTATGAG
CTGCGTGTTG CCGCCGATCC CGAAGGCGCA ACCGTGCGCT TTGAGGATCA AAGCGAGATC
CGCGTGAGTT CCGATTGGAC GCCGGGTGAC AGCCTTGCCC ATGTGGATGC GGATGGCACG
CCTCTGGTGC TGAAGGTCGA CAAGATCACC CAAGGCTTCC GCGTGCGCAG CCGGGGCGCG
GACCTCAAGG TGCATGTGCG CCGTCCGCGT CAGGCCGAAC TGGCCGCCTT GATGCCCGAA
AAACTGCCGC CCGATACCTC CAAGATGCTT CTGTGCCCAA TGCCCGGTCT TGTTGTGAAG
ATCAACGTCG AGGTGGGCGA AGAAGTGCAG GAGGGGCAGG CGCTCTGCAC CATCGAGGCG
ATGAAGATGG AAAACATCCT GCGCGCCGAG AAAAAATCCG TGGTCTCCAA AATCAATGCG
GCGGCAGGCG ACAGCCTCGC GGTGGACGAT GTGATCATCG AATTCGAATG A
 
Protein sequence
MFEKILIANR GEIACRVIKT ARKMGIKTVA IYSDADRQAL HVQMADEAVH VGPAPANQSY 
IVIDNVMAAI KSSGAQAVHP GYGFLSENAK FAEALEAAGV AFVGPPKGAI EAMGDKITSK
KIAQEAGVST VPGYMGLIAD ADEAVKISNE IGYPVMIKAS AGGGGKGMRI AWTDEEAREG
FQSSKNEAAN SFGDDRIFIE KFVTQPRHIE IQVLCDAHGN GVYLGERECS IQRRNQKVVE
EAPSPFLDEE TRRAMGEQSV ALAKAVGYAS AGTVEFIVDG DKNFYFLEMN TRLQVEHPVT
ELITGVDLVE QMIRVAAGKE LSITQNDVKL TGWAIENRLY AEDPYRNFLP SIGRLTRYRP
PAETAAYTPG VAPGDAGDVV VRNDTGVYEG GEISMYYDPM IAKLCTWAPT RDAAIEAMRA
ALDSFEVEGI GHNLPFLSAV MDHPKFVSGE MTTAFIAEEY PEGFDGVDLP ESDLKRIAAS
CAAMHRVAEI RRTQVSGRMD NHERRVGNTW VVAIGGQTYE LRVAADPEGA TVRFEDQSEI
RVSSDWTPGD SLAHVDADGT PLVLKVDKIT QGFRVRSRGA DLKVHVRRPR QAELAALMPE
KLPPDTSKML LCPMPGLVVK INVEVGEEVQ EGQALCTIEA MKMENILRAE KKSVVSKINA
AAGDSLAVDD VIIEFE