Gene Noc_2208 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2208 
Symbol 
ID3705146 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2551657 
End bp2552781 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content56% 
IMG OID637738684 
Product2-methylcitrate synthase/citrate synthase II 
Protein accessionYP_344198 
Protein GI77165673 
COG category[C] Energy production and conversion 
COG ID[COG0372] Citrate synthase 
TIGRFAM ID[TIGR01800] 2-methylcitrate synthase/citrate synthase II 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.539974 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGGAG GAACCGCTGG TAGCGGCCTG GCCGGTGTTA CCGCTGGCAA AACGGCCATC 
AGTACCGTGG GGAAAGAAGG TAAGGGATTG ACCTATCGGG GTTACGCTAT TGAGATGCTG
GCCGAGAGGG CCAGCTTCGA GGAAGTGGCC TATTTGCTGA TTTATGGCCA ATCACCCAGC
CGTGCCCAGC TAACAGAGTA CCGGGAGCAG TTGCGATCCC TGCGCCGTTT ACCCGAGGGT
CTTAAACCCG TGCTGGAAGC CCTGCCGGGC AATAGCCATC CCATGGATGT AATGCGCACG
GGTTGTTCCG CCCTGGGTTG TCTCGAACCT GAAATAAACC GGGATCAGCA GTGGGATATC
GCTAATCGTT TGTTGGCTCT GTTTCCTTCC CTGCTGCTGT ATTGGTACCA GTTTCACCAT
CACGGTATCC GGATTGAGAC GGATACGGAG GAAGAATCCC TGGCGGGGCA TTTTTTAAAC
CTGCTCCATG GGGAATCTCC CGGGGCGATT CCCCGACGGG TGTTAGATGC CTCCCTCATT
CTTTACGCTG AGCATGAGTT CGCAGCTTCC ACTTTCGCCG CCCGGGTAAC CACCTCCACC
CTTTCCGATT TTTATTCCGC GGTTACCGCC GCCATTGGCA CCCTGCGGGG AACCTTGCAC
GGGGGCGCCA ATGAGGCGGC GATGGCGCTT ATCGAACGTT TCCAGGAGCC TGATGAGGCT
GAGCAGGGGG TTCTGGCGGC GTTGGAAAAC AAAGAAAAAA TCATGGGCTT TGGCCATCGG
GTGTACAAAG AAGCCGACCC CAGGAACGCC ATTATCAAGG ATTGGTGCCG GCAATTGGCG
GAACGGGCGG GGGACAGCTA TTTGTTTCCC ATTGCCGAGC GGATCGAGGT CGTCATGAAG
CGGGAAAAGG GCTTGTTTCC TAATTTGGAT TTTTATATCG CCCCGGCTTA TCGCTTTTTG
GGAATTCCCG TTCATCTTTA TACGCCCCTC TTTGTGTGTT CCCGGGTGGC GGGTTGGGCC
GCCCATATTA TGGAACAACG GGCCGATAAC CGCTTGATTC GTCCCATGGC CAACTACATT
GGTCCCGAGC CCCGGGATTT TGTGCCCATT GAGCAGCGTG GCTAA
 
Protein sequence
MSGGTAGSGL AGVTAGKTAI STVGKEGKGL TYRGYAIEML AERASFEEVA YLLIYGQSPS 
RAQLTEYREQ LRSLRRLPEG LKPVLEALPG NSHPMDVMRT GCSALGCLEP EINRDQQWDI
ANRLLALFPS LLLYWYQFHH HGIRIETDTE EESLAGHFLN LLHGESPGAI PRRVLDASLI
LYAEHEFAAS TFAARVTTST LSDFYSAVTA AIGTLRGTLH GGANEAAMAL IERFQEPDEA
EQGVLAALEN KEKIMGFGHR VYKEADPRNA IIKDWCRQLA ERAGDSYLFP IAERIEVVMK
REKGLFPNLD FYIAPAYRFL GIPVHLYTPL FVCSRVAGWA AHIMEQRADN RLIRPMANYI
GPEPRDFVPI EQRG