Gene TM1040_3024 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3024 
Symbol 
ID4076597 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp3192426 
End bp3193610 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content59% 
IMG OID638008353 
Product2-amino-3-ketobutyrate coenzyme A ligase 
Protein accessionYP_615018 
Protein GI99082864 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0156] 7-keto-8-aminopelargonate synthetase and related enzymes 
TIGRFAM ID[TIGR00858] 8-amino-7-oxononanoate synthase
[TIGR01822] 2-amino-3-ketobutyrate coenzyme A ligase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.299571 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.986535 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAATG CTTTTCTCAG CCACATCAGC GAAACCCTGA CGCAGATCGA AGCCGATGGT 
CTCTACAAGC GCGAGCGAAT GATCACCTCG CCTCAGGGCG GCGAGATCCG GGTTGGGGAC
CGCGAAGTTA TCAATCTTTG TGCCAATAAC TATCTTGGGC TGGCCGACCA TCCCGATTTG
ATCGCCGCCG CAAAATCCGT GATGGACGAA AAAGGTTATG GTATGGCCTC TGTGCGCTTC
ATCTGCGGGA CACAGGATCT GCATCGAACG CTGGAGCAGA AGATAGCAAA TTTTCTCGGC
AAGGACGATT CGATCCTCTT TGCAGCCTGT TTTGACGCCA ATGGCGGGTT GTTCGAGCCG
TTGCTCGGCC CTGAAGATGC GATCATTTCC GACAGCCTGA ACCACGCCTC GATCATCGAC
GGCATCCGTC TTTGCAAGGC ACAGCGCTAT CGCTATGCCA ATAATGACAT GGAGGATCTC
GAGGCGAAGT TGAAGGACGC GCGTGCCAAG GGCGTGCGCC ACATCATGAT CGCAACCGAT
GGGGTTTTCT CCATGGATGG CTACCTCGCC AATCTGCCTG CGATCCGGGA GATTGCCGAT
CGGCATGACG CGATGGTGAT GGTAGATGAC TGTCATGCAA CCGGCTTCAT GGGGCCAAAA
GGGGCAGGTA CGCCGGATCA CTTCGGCGTG GACGTCGATA TCCTGACCGG CACGCTGGGC
AAGGCGCTGG GCGGTGCGAT TGGAGGCTAC ATCGCCGGCC CCCAGCCCGT GATCGATCTG
CTGCGCCAAC GGGCGCGCCC CTATTTGTTC TCAAACTCCC TACCGCCCGC GGTGGTGGCT
GCCGGGCTGG AGGCGATCCG CCTGGTCGAG GAAGGCGAGA GCCTGCGCCG ACAACTGTTT
GAAAACGCCA CGATCTGGCG CGAAGGGCTA ACCCGTCTGG GTTTTGACCT GCTGCCCGGA
GAGCACCCGA TCATCCCGGT GATGCTGGGC GATGCCAAGC TGGCACAGGA AATGGCCAAT
AAATTGTTTG AAGAGGGCGT CTATGTCTCC GGCTTTTTCT TTCCCGTTGT GCCAAAAGGA
CAAGCCCGCA TCCGCACCCA GATGAACGCC GCCCTGACCC AAGACGAGTT GAACCGAGCC
CTGAACGCGT TTGAGCGTGC GGGCAAGGCC TGTGGAGTGA TCTGA
 
Protein sequence
MSNAFLSHIS ETLTQIEADG LYKRERMITS PQGGEIRVGD REVINLCANN YLGLADHPDL 
IAAAKSVMDE KGYGMASVRF ICGTQDLHRT LEQKIANFLG KDDSILFAAC FDANGGLFEP
LLGPEDAIIS DSLNHASIID GIRLCKAQRY RYANNDMEDL EAKLKDARAK GVRHIMIATD
GVFSMDGYLA NLPAIREIAD RHDAMVMVDD CHATGFMGPK GAGTPDHFGV DVDILTGTLG
KALGGAIGGY IAGPQPVIDL LRQRARPYLF SNSLPPAVVA AGLEAIRLVE EGESLRRQLF
ENATIWREGL TRLGFDLLPG EHPIIPVMLG DAKLAQEMAN KLFEEGVYVS GFFFPVVPKG
QARIRTQMNA ALTQDELNRA LNAFERAGKA CGVI