Gene BBta_4067 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_4067 
SymbolispDF 
ID5153386 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp4261124 
End bp4262305 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content68% 
IMG OID640558900 
Product2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase 2-C-methyl-D-erythritol 2,4-cyclodiphosphate synthase 
Protein accessionYP_001240039 
Protein GI148255454 
COG category[I] Lipid transport and metabolism 
COG ID[COG0245] 2C-methyl-D-erythritol 2,4-cyclodiphosphate synthase
[COG1211] 4-diphosphocytidyl-2-methyl-D-erithritol synthase 
TIGRFAM ID[TIGR00151] 2C-methyl-D-erythritol 2,4-cyclodiphosphate synthase
[TIGR00453] 2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.438057 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGATTT CTCAGCGGAC TGCAGCCATT CTCGTCGCGG CCGGACGCGG CCTGCGCGCC 
GGCACCGGCG GTCCGAAGCA ATATCGCGCG ATCGGCGGCC GCACCGTCAT CCACCGCGCG
CTCGCCGCCT TTTCGGAGCA TCCTGACGTC GCCGTCGTGC AGCCGGTGGT GAACCCCGAT
GACATCGACG TCTTCAATGC CGCGGTCAGT GGCCTGCGCC ACGAGGTGCC CGCGCATGGC
GGCGCGACAC GGCAGGCGTC AGTGCTCGCA GGCCTGGAAG CGCTGGTGCC GCACCGACCG
GACATCGTGC TGATCCACGA CGCCGCGCGC CCGTTCGTGA CATCGGCCGT GATCTCGCGC
GCGATCCAAG CGGCCGGCAA GACCGGCGCG GCCATTCCCG TCGTGCCCGT CACCGACACG
ATCAAGGAGG TCACGGCGAG CGGCGACATC ATCGCGACAC CGGAGCGCGC GAAGCTGCGC
ATCGCGCAGA CGCCGCAGAC CTTCAAATTC GAGGTCATCC TGGAGGCGCA TCGGCGCGCC
GCGCGCGACG GCCTCACCGA GTTCACAGAT GATGCGGCGA TCGCCGAATG GGCGGGATTG
ACCGTCGCGA CGTTTGAGGG CGATGTTGCC AATATGAAGC TCACCACACC CGAAGATTTC
GTGCGCGAGG AAGCGCGGCT CGCCGCTCAG CTCGGCGACA TCAGGACCGG CACCGGCTAC
GACGTGCATG CCTTCGGCGA GGGCGACCAT GTCTGGCTGT GCGGCCTGCG CGTGCCGCAT
AGCAAGGGCT TCCTGGCCCA CTCCGACGGC GACGTCGGAT TGCACGCCCT GGTTGACGCA
ATTTTGGGCG CCCTGGCCGA TGGTGACATC GGCTCGCATT TCCCGCCCTC GGACATGAAG
TGGAAGGGCG CCTCGTCCGA TCAGTTCCTG AAATACGCGA TCGAGCGGGT CACGGCGCGC
GGCGGACGGG TGGCCAATCT CGAGGTGACG ATGATCTGCG AACGGCCGAA GATCGGTCCC
CTGCGCGACC AGATGCGCGC ACGCATCGCC GAGATTTCGG GAGTCGATAT CTCGCGCATC
GCGGTGAAAG CCACCACCAG CGAGCGCCTC GGCTTCACCG GCCGCGAGGA AGGCATCGCC
GCGACCGCAA GTGCGACGAT CCGGCTGCCG TGGAGCGCAT GA
 
Protein sequence
MTISQRTAAI LVAAGRGLRA GTGGPKQYRA IGGRTVIHRA LAAFSEHPDV AVVQPVVNPD 
DIDVFNAAVS GLRHEVPAHG GATRQASVLA GLEALVPHRP DIVLIHDAAR PFVTSAVISR
AIQAAGKTGA AIPVVPVTDT IKEVTASGDI IATPERAKLR IAQTPQTFKF EVILEAHRRA
ARDGLTEFTD DAAIAEWAGL TVATFEGDVA NMKLTTPEDF VREEARLAAQ LGDIRTGTGY
DVHAFGEGDH VWLCGLRVPH SKGFLAHSDG DVGLHALVDA ILGALADGDI GSHFPPSDMK
WKGASSDQFL KYAIERVTAR GGRVANLEVT MICERPKIGP LRDQMRARIA EISGVDISRI
AVKATTSERL GFTGREEGIA ATASATIRLP WSA