Gene Noc_2207 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2207 
SymbolprpD 
ID3705145 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2550193 
End bp2551644 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content58% 
IMG OID637738683 
Product2-methylcitrate dehydratase 
Protein accessionYP_344197 
Protein GI77165672 
COG category[R] General function prediction only 
COG ID[COG2079] Uncharacterized protein involved in propionate catabolism 
TIGRFAM ID[TIGR02330] 2-methylcitrate dehydratase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.50931 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAATG ACAGACGTTC AGCCCAGCGG CCGCCGCCGG ATGAGGTTCT GACGATCTTA 
GCGGATTATG TTCTCCAGGG GGAAGTTAAC CGTCCCGAAG CCTATGAGAC TGCCCATGAC
TGCTTAATGG ACAGCCTTGG CTGTGCTTTG CTCGCCCTGG ATAATCCTGC CTGTACCCGC
CTGCTTGGCC CGATTGTCCC TGGCGTGATT TTCCCCGCGG GCGCCCGGGT GCCGGGAACC
GGCTATGTCC TTGATCCGGT GCAGGCCGCT TTCAATATCG GCATCATGAT CCGCTGGCTG
GATTTCAATG ACACTTGGCT GGCCGCTGAA TGGGGCCATC CCTCCGATAA TCTGGGGGCT
ATTCTAGCCA TTGCCGATTA TTTGAGCCGC CAGCGGCGGC AAGAAGGCGC GCCGCCGCTG
GCGATGCGCC AGGTGCTGAC CGCATTGATC AAAGTCTATG AAATTCAGGG CGTGTTGGCC
TTGGAGAATG CCTTTAATCG GGTGGGATTG GATCATGTGG TGCTGGTACG GGTGGCTAGC
GCCGCCGTGA CAGCAGCTTT ATTAGGCGGT ACTCAGGAGC AGATTATCAA CGCCCTTTCC
AATGCCTGGC TGGACGGGGG GCCCCTGCGG ACCTACCGCC ATGCCCCCAA TACCGGTTCA
CGGAAAAGCT GGGCCGCGGG GGATGCTACC AGCCGGGGGG TGCGTTTAGC TCTTATGGCC
TTGCAAGGGG AGATGGGCTA CCCGTCGGCT TTGACGGCCC CTGGCTGGGG GTTTTACCAG
GTGTTATTCA AAGGAGAGTC CTTTACCTTG CCTCGGGCTT TAGGCAGTTA TGTCGTGGAG
AATATCCTCT TCAAAGTAGC CTATCCGGCG GAGTTCCATG CCCAGACCGC CATTGAGGCG
GCGATTTCAC TCCATCCCCA GGTGACGTCC CGATTGTCGG AGGTGGCACG GATTGTCATT
GAGACCCAGG AACCGGCGGT GCGAATCATC GACAAGACTG GTCCCCTCCA TAATCCTGCG
GATCGGGATC ACTGCCTGCA GTACATGGTG GCGGTAGCCT TGCTGGAGAG CCAGATCACC
ATGAAGGATT ACGAAGATGA ACGGGCCCGG GACCCCCGGA TTGACGCCCT GCGGGAGAAG
ATGGAGGTCA TCGAAAAGAA GGAATTTACC GAGGATTATT TAGATCCCGA GAAACGGGCG
ATTGCCAATG CGGTCCAGGT ATTTTTCAGC GATGGCAGCG CTACCCTCCG GGTGGAGGTG
ACCTATCCTT TAGGCCACCG GCGGCGGCGC GCCGAAGCCT TGCCCCTGCT ACGGGATAAA
TTCCAGAACA GCCTGGGCGG CTGTTTTCCT CCGGAACGTT GCCAGACAAT TTTGGATCTC
TTTAGCGACC GGGAGCGTTT GGCGGCCATG CCCGTGGATG AGTTCATGGA ACTGTTTATT
AGCACGGCCT GA
 
Protein sequence
MSNDRRSAQR PPPDEVLTIL ADYVLQGEVN RPEAYETAHD CLMDSLGCAL LALDNPACTR 
LLGPIVPGVI FPAGARVPGT GYVLDPVQAA FNIGIMIRWL DFNDTWLAAE WGHPSDNLGA
ILAIADYLSR QRRQEGAPPL AMRQVLTALI KVYEIQGVLA LENAFNRVGL DHVVLVRVAS
AAVTAALLGG TQEQIINALS NAWLDGGPLR TYRHAPNTGS RKSWAAGDAT SRGVRLALMA
LQGEMGYPSA LTAPGWGFYQ VLFKGESFTL PRALGSYVVE NILFKVAYPA EFHAQTAIEA
AISLHPQVTS RLSEVARIVI ETQEPAVRII DKTGPLHNPA DRDHCLQYMV AVALLESQIT
MKDYEDERAR DPRIDALREK MEVIEKKEFT EDYLDPEKRA IANAVQVFFS DGSATLRVEV
TYPLGHRRRR AEALPLLRDK FQNSLGGCFP PERCQTILDL FSDRERLAAM PVDEFMELFI
STA