Gene Noc_0037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_0037 
Symbol 
ID3705970 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp36273 
End bp37454 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content50% 
IMG OID637736561 
Producthypothetical protein 
Protein accessionYP_342109 
Protein GI77163584 
COG category[S] Function unknown 
COG ID[COG1565] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCCGCG TTCAAAAAGA TTTTTTTCCC CACCCTGAGC CGATAGCCTT GGCTCATAGC 
CAAAAATTGG AAAACGTGAT CCAAACGACC ATTGAGCAAG CAGGCGGCCA AATCCCCTTT
GCTCGCTTTA TGGAACTAGC GCTTTATACG CCGGGTTTAG GTTATTACAT GGCGGGTCTC
CATAAGCTAG GTACTTTCGG CGATTTTATT ACCGCTCCGG AGCTATCCCC TCTTTTTGCC
CGCTGCATCT CTCGCCAATG TCAGCAGATA TTTGAGCTGC TTGGAACAGG CGATATTTTA
GAATTTGGGG CTGGATCAGG CCGCCTAGCT GCGGATTTAC TTAGTGAACT AAACCTTAGC
GGTAATCTAC CGGAACGGTA TTTTATCTTG GAACTTAGTG CCGATTTGCG TCATCGCCAA
CAGGAAACAC TCTACCAGCG AGTACCCCTC CTCGCCTCAA GAGTAAGTTG GCTAGATCGA
CTACCCGACA GAATTGACGG CTTTATTCTA GCTAATGAGG TGTGCGATGC CATGCCTACG
CACTGCTTCC AGCTTGAAAA CGGGTACGAC TGGGAACGCT ACGTAGGCTA CGAGAAAGGC
AAGTTTGTCT GGAAAAAAGG CCCTTTAAGT CATCCCCTCC TGAAAGATCG CATTGCCAAA
ATACGCCTGC TTCTTAAACA TGTAAATAGC TACGAATCTG AAATTAATTT AGCTATGGAA
GGCTGGACTA CTGAAATCGC CCATCGATTG CGGAAGGGGA TGCTCCTCAT CATTGACTAT
GGCTTTCCTC GGCATGAGTA CTATCATCCA GAGCGAATGA TGGGCACTCT GATGTGCCAT
TATCGCCACC AGGCCCATCC CAATCCACTA ATCATGGCGG GGTTACAAGA TATCACTACC
CATGTGGATT TTACTGCTCT TGCCGAAGCA GGCCATAGTA GTGGGCTTAG GGTGGCCGGG
TATTGTACGC AAGCCGATTT CTTGCTGGCC TGCGGTTTGG ATAAACTAGC TGCGACCGAA
ATCGCAGCAG GGGAGAAGCA GGCTTTGGAA ACCAGCCAAC AGATCAAGCG CCTTGTTCTC
CCCAGCGAGA TGGGTGAACT TTTTAAGGCC CTCGCCCTAA CCCGGGAAAT TAACCAGCCC
CTATTAGGTT TTAATTTGCG GGATCGGCGG GCCCGCCTAT AA
 
Protein sequence
MRRVQKDFFP HPEPIALAHS QKLENVIQTT IEQAGGQIPF ARFMELALYT PGLGYYMAGL 
HKLGTFGDFI TAPELSPLFA RCISRQCQQI FELLGTGDIL EFGAGSGRLA ADLLSELNLS
GNLPERYFIL ELSADLRHRQ QETLYQRVPL LASRVSWLDR LPDRIDGFIL ANEVCDAMPT
HCFQLENGYD WERYVGYEKG KFVWKKGPLS HPLLKDRIAK IRLLLKHVNS YESEINLAME
GWTTEIAHRL RKGMLLIIDY GFPRHEYYHP ERMMGTLMCH YRHQAHPNPL IMAGLQDITT
HVDFTALAEA GHSSGLRVAG YCTQADFLLA CGLDKLAATE IAAGEKQALE TSQQIKRLVL
PSEMGELFKA LALTREINQP LLGFNLRDRR ARL