Gene Noc_0210 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_0210 
Symbol 
ID3706265 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp233680 
End bp235110 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content54% 
IMG OID637736726 
Producthypothetical protein 
Protein accessionYP_342270 
Protein GI77163745 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCATCGTC TTTGTCAAAA AAACTTTGGC GGAGCAGGCG AGCTGCTTCG GGAGGCGGCG 
GAAGGAAAGG CGAAACAGCT CGCTAAAGTA CGCAAGGAAT TAGAACAGCT CACAGAAGAA
ACTGTGAATA GTTTTCGACT TGCGGGCGAT GCCCACAGCA ATAATTATGC TTTTGACGAA
GCTTTAATTG CTTATGAACA AGCACTGGCA TGTGTGCTGA GAGAGATAAC TCCTCATTTA
TGGGCGGTTA TCATGGTTCA GGTGGGGAGA GCCTGCCATG AATTGGGCGT TCGGGCTGAA
GGAGCGGCGT TGCATTATCA TTTATCGGCT GCGGTAGAGG CTTACCGGCG GGCTTTAAAG
GTACAAACCC GGCGGTATTT ACCCCAGGAT TGGGCCCGAA CCCAGGCGTA TCTGGGGACT
ACCTTACGGG AGCAGGGAGT CCGGATGGGA GAAGAGGCCG GAGGCCGGTT GCTGGAGCAG
TCGGTAGAGG CTTACCGGCG GGCCTTAAAG GTACAAACCC GGTGGCATTT ACCCCAAGAC
TGGGCCCGAA CCCAGGCCCA TTTGGGGACC ACTTTGCGGG AGCAGGGAGT CCGGATGGGA
GGAGAGGCCG GAGGCCGGTT GCTGGAGCAG TCGGTAGAGG CTTACCGGCA GGCGCTGGAG
GTGCAAACCC GGCGAGATTT TCCCCAAGAT TGGGCTTGGA CCCAGAGCCA CTTGGGGATT
GCTTTGCGGG AACAGGGCAT GCAGGCAGGA GGAGAAGCCG GGCGGCAGCT ATTGGGGAAA
GCGATAGCAG CTTATCAAGG GGCGTTGGAG ATTCATACGC CTGAAACGCT TCCTTGGCAT
TGGAATCAAA CTCAACATCA TCTAATTCAA ACCTGGCTTG CGCTCGAAGA CTGGCCTGCA
GCGGCCACGG GTTTTGTCCG CTTACTAGAA ATCTATCCTG ATGATGCAGA AGCCTATTAC
GGAGCTAGCA TGCTTTACCA TGAGAAGCTG TTTGCCTTTG AAGAGGCTTT TTCCCTTAGC
CGGCGATGGT TCGCTTCTCG CCCCGAAGAT CTAGAAGCCC GCGGCCAGTT GGCGGAGCAG
TGTTTCACTA CTGGACGTTT TGCCGAGGCT ATTGAACATT TTGCAGAGTT GCTGGCCAAC
CCTGAGATTA ATCCTCAAAT AAAAATCCCT TTGCAGGCTT TTGAGATTGC AGCGCTGCTG
GGTTTAAACC AAAAGGTGGC CGTTCCAGAA AAATTCGAGC AATTATGTAT GGCCATTGCT
CACCAGTCTG GGAACTTTGT ACTGGAGTGG GCCTTTGCGG GGAGCAAATA TTTTATTGCT
CGAAATGAGC GTCTCATGCC TTACCGTGAA TGGCTGCTAG CGTTATTCAT CGCGCTAGAA
GCGCCAGGCC GAGACGCTAT TTTAAACCGC CTGGGAGAGG ATATACGGTA G
 
Protein sequence
MHRLCQKNFG GAGELLREAA EGKAKQLAKV RKELEQLTEE TVNSFRLAGD AHSNNYAFDE 
ALIAYEQALA CVLREITPHL WAVIMVQVGR ACHELGVRAE GAALHYHLSA AVEAYRRALK
VQTRRYLPQD WARTQAYLGT TLREQGVRMG EEAGGRLLEQ SVEAYRRALK VQTRWHLPQD
WARTQAHLGT TLREQGVRMG GEAGGRLLEQ SVEAYRQALE VQTRRDFPQD WAWTQSHLGI
ALREQGMQAG GEAGRQLLGK AIAAYQGALE IHTPETLPWH WNQTQHHLIQ TWLALEDWPA
AATGFVRLLE IYPDDAEAYY GASMLYHEKL FAFEEAFSLS RRWFASRPED LEARGQLAEQ
CFTTGRFAEA IEHFAELLAN PEINPQIKIP LQAFEIAALL GLNQKVAVPE KFEQLCMAIA
HQSGNFVLEW AFAGSKYFIA RNERLMPYRE WLLALFIALE APGRDAILNR LGEDIR