Gene BBta_5998 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_5998 
SymbolleuC 
ID5154237 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp6212673 
End bp6214094 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content67% 
IMG OID640560721 
Productisopropylmalate isomerase large subunit 
Protein accessionYP_001241843 
Protein GI148257258 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0065] 3-isopropylmalate dehydratase large subunit 
TIGRFAM ID[TIGR00170] 3-isopropylmalate dehydratase, large subunit 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.00498287 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGATGGATG TAGCGCGGCG GCAGCGAACG CTGTTCGACA AGGTCTGGGA TGCACATGTG 
GTGACGCGCC GCGAGGATGG CGCGGAGCTC TTGTTCATCG ATCGCCACCT CGTGCATGAG
GGATCCTTCC ACGCCTTCAA CAAGCTGAAG GAGAGGCGCG CGCAGGTGCG CCGGCCTGAT
CTCACGATCG GCGTCGCCGA TCACTACGTG CCGACGCGGA CCCGCGTGCT CAGCGAGATC
GCGCCGGAGA TCGCGGGCAT GATCCGCCAG CTCGACGACA ATTGCCGCGC CAATGATATC
CGTCTCTTCG GCTTCGACGA TCCGCGGCAG GGCATCGTCC ATGTGATCGG GCCCGAACAG
GGCCTCACTC TGCCTGGTCT CACGATGGTC TGCGGCGACA GTCACACCTC GACGCATGGG
GCGTTCGGCG CGCTCGCCTT CGGCATCGGC GCTTCGGAGG TTGCGCATGT GCTGCTGACG
CAATGCCTGT GGCAGAAGAG ACCGAAGCAG ATGCGCATCA CGATCGACGG CGCCCTTGCA
TCAGGGATCA CCGCCAAGGA TGTCGCGCTC GCGATCATCG CCAGGATCGG CGCCGATGGC
GCCCGCGGCC ATGCCATCGA ATATGCCGGA ACCGCCATCG ATGCGCTGTC GATGGAGGGA
CGGCTGACGC TGTGCAATCT CGCGATCGAG AGCGGCGCGC GTTGCGGGAT GATCGCGCCC
GACGAGACAA CCTTCGCCTA TGTGACGGGG CGGCCGTTCG CGCCCAAGGG CGATCTCCTC
GATCGCGCCA TCGCGAATTG GCGCGAGCTC GCGACCGACG CCGAGGCCGC GTTCGATCGC
GAGATCCGCC TCAATGGCCA GGAGATCGCG CCGACGGTCA CCTGGGGCAT CAGTCCGGAG
GACGCGCTGC CGATCAGCGC GGCCGTGCCT GATCCCGCTA TCTTCGACGA CCCCGCGCAA
GCGAGCCATG TGCGCGAGGC GCTCGACTAT ATGGGGCTTC AGGCCGGCCA GGCGCTCGAC
AGCATCAAGA TCGACCGCGT CTTCATCGGC TCCTGCACCA ACAGCCGCAT CGAGGATCTG
CGGGCGGCCG CCGCCATCCT CGCCGGCCGC ACCGCCCGGG TGCCAGGGCT GGTGTCGCCG
GGCTCGCACC TCGTCAAGCA GCAGGCCGAG CAGGAAGGCC TCGACCAGAT CTTCCGCGGC
GCGGGCCTCG ACTGGGTCGG CTCCGGCTGC TCGATGTGCG TCGGCATGAA TGGCGACCTC
GTGCCGGCCG GCGAGCGCTG CGCGTCGACC ACCAACCGCA ACTTCAAGGG CCGGCAAGGT
CAAGGCGCGC GCACGCATCT GATGTCGCCG GCGATGGTGG CGGCCGCAGC CGTGACCGGC
CAGCTGACCG ACGTGCGGAA CTTTCTGAGG GGCGATCGAT GA
 
Protein sequence
MMDVARRQRT LFDKVWDAHV VTRREDGAEL LFIDRHLVHE GSFHAFNKLK ERRAQVRRPD 
LTIGVADHYV PTRTRVLSEI APEIAGMIRQ LDDNCRANDI RLFGFDDPRQ GIVHVIGPEQ
GLTLPGLTMV CGDSHTSTHG AFGALAFGIG ASEVAHVLLT QCLWQKRPKQ MRITIDGALA
SGITAKDVAL AIIARIGADG ARGHAIEYAG TAIDALSMEG RLTLCNLAIE SGARCGMIAP
DETTFAYVTG RPFAPKGDLL DRAIANWREL ATDAEAAFDR EIRLNGQEIA PTVTWGISPE
DALPISAAVP DPAIFDDPAQ ASHVREALDY MGLQAGQALD SIKIDRVFIG SCTNSRIEDL
RAAAAILAGR TARVPGLVSP GSHLVKQQAE QEGLDQIFRG AGLDWVGSGC SMCVGMNGDL
VPAGERCAST TNRNFKGRQG QGARTHLMSP AMVAAAAVTG QLTDVRNFLR GDR