Gene Cmaq_1167 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_1167 
Symbol 
ID5709430 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp1224914 
End bp1226161 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content50% 
IMG OID641275668 
Product3-isopropylmalate dehydratase large subunit 
Protein accessionYP_001540984 
Protein GI159041732 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0065] 3-isopropylmalate dehydratase large subunit 
TIGRFAM ID[TIGR01343] homoaconitate hydratase family protein
[TIGR02086] 3-isopropylmalate dehydratase, large subunit 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCCTCA CGTTAACCGA GAAGATACTG AGCAAGGCTG CGGGAAGGCA GGTTTCCCCA 
GGTGACGTAA CGGAGATAAC AGTTGACTTA GCGGCATTCC ATGACCTAAC CGGCCACCAC
GTAGTGGAGG TTATGGAGAA TATTGGTGCC GTTAAGGTTT GGGACCTTGA TAGGTTCGTA
ATAGCGTTCG ACCACTTGGC GCCGCCGCCT AATGATAGGG CTGCTGAGAT TCAGGTTAAG
TTAAGGAAGT TCGCTAAATC CATTAACGTA AGGAACTTCC ATGATGTTGG AGACGGCATC
CTACATCAAC TACTCCTAGA GAAGTACGCG TTACCTGGTC AAGTAGTGAT GGCTGCGGAC
AGCCACACAA CAACAGTTGG CGCAGTGGGG GCATTCGCCC AGGGAATGGG GGCCAGTGAC
ATGGCTGCAA TACTCATGAC AGGTAAGACT TGGTTAATGA TCCCTGAACC ATTCCTAATA
AGGCTAATTA ATGAACCAGC CCCAGGCGTA TACGGTAAGG ATGTGGCGCT TCACATACTC
TCAGTGTTTA AGGCGGAGGG CTTAAACGGT AAGTCTGTTG AATTACAGGT TGAGAAGCCT
AAGGCATTCC CAATGGATTA TAGGGCTACG GTATCAAACA TGGGTGTTGA GTTCGGCGCC
GATGCAGCAA TATTTATTCC AGATGAGGAG ACTGTAAGCT ACCTGAGTAG AAGTAGGGGT
ATTAATGTTA AGCCAATTAC CCCTGATCCA GACGCCAAGT ACGTTGATGA GTACACCATT
GAGTTAAATA AGCTTGAACC ACTTGTGGCT GCACCGCATA GTGTTGATAA CGTTAAGCCT
GTTAGTGAGG TGGAGGGCAT TGAGGTTGAC TACGTCTTCA TAGGGTCATG CACCAACGGT
AGGTTAAGTG ACCTTGAGGC TGCAGCCAGG ATACTTAAGA ATGGTAAGGT TAAGGCTAGG
TGCATTGTCA TCCCAGCATC AAGGGACCTA TTCACTAAGG CCCTTGACGC CGGTTACGTA
GAGACGTTAA CTAAGGCTGG TTGTGTGGTT ACGTACGGTA CATGTGGACC ATGCCTCGGA
GGCCACTTCG GTGTAATTGG GCCAGGGGAG ACTGCTGTGT CTACTGGTAG TAGGAATTTT
AAGGGTAGGA TGGGGTCACC TGAGGGTAAG GTTTACCTAG CCAATGCAGC CACAGCGGCG
GCAACTGCGC TTGAGGGTAG GTTAACGGAC CCGAGGAAGT ACCTTTAA
 
Protein sequence
MGLTLTEKIL SKAAGRQVSP GDVTEITVDL AAFHDLTGHH VVEVMENIGA VKVWDLDRFV 
IAFDHLAPPP NDRAAEIQVK LRKFAKSINV RNFHDVGDGI LHQLLLEKYA LPGQVVMAAD
SHTTTVGAVG AFAQGMGASD MAAILMTGKT WLMIPEPFLI RLINEPAPGV YGKDVALHIL
SVFKAEGLNG KSVELQVEKP KAFPMDYRAT VSNMGVEFGA DAAIFIPDEE TVSYLSRSRG
INVKPITPDP DAKYVDEYTI ELNKLEPLVA APHSVDNVKP VSEVEGIEVD YVFIGSCTNG
RLSDLEAAAR ILKNGKVKAR CIVIPASRDL FTKALDAGYV ETLTKAGCVV TYGTCGPCLG
GHFGVIGPGE TAVSTGSRNF KGRMGSPEGK VYLANAATAA ATALEGRLTD PRKYL