Gene Noc_2201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2201 
Symbol 
ID3705139 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2544640 
End bp2546529 
Gene Length1890 bp 
Protein Length629 aa 
Translation table11 
GC content55% 
IMG OID637738677 
Producthypothetical protein 
Protein accessionYP_344191 
Protein GI77165666 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTCGCC ACCTGGGTTT ATTTTGTCTT CTTTATGGTA TGTGCGGCTG GGCCCAGGCG 
GAGATTCCCA CCCTGGAGGA ACTGCTGGCG GCAGCCCAGC AACAGCAGCT TTCGGAACGC
CCCGCGTGGT GGGCCTTGCT TCATTACCGG CCGACCGTGC TGGGATGGAC TCACAAAAGC
CAAGCGGACG ATCCCCGTTT TTTTCTTGCC AAAAACGGGA AAACCGATCC CCATGGGGAG
CTTACGGCAA CGCTCATAGC GTTGCGCAAT GGGGATGGCC AGGGGACTCA TCCCCAATGC
CGCTTTCCTG CTCGCTTTTA TTGGCTACAA CAACAGGGCC ATCCTTTCTC CCGTATTCCG
GCGGTGCCAT GCCCACAGCT TGCAAGTTGG TTGGAGCAAT TGAATACGGC CCGGGTTACC
CTGGTATTTG CCTCCTCCTA TCTCAATAGC CCATCCTCCA TGTTTGGCCA TACCTTTTTG
AGGTTAGATC CGCCCAGCTT CAATCAGGAC AGCCTGATTT TAGCCAGCAC GGTCAGCTAT
GCCGCAGATG CCGCGGCCCA TGACAGCGAA ATCATGTTTG CTTACCGGGG TATTTTTGGG
GGTTATCCGG GAATCACCAC GGTAGAGCCT TATTTCGACA AGCTTCGCCT CTATAGCAAA
CTGGAAAACC GGGATCTGTG GGAGTACGCG TTGAATCTGC GTCCTGATGA GGTGACCCAG
TTACTTCGCC ATGTCTGGGA AATAAAAGAT AAGCGGTTTG ATTATTTTTT CTTTGACGAA
AACTGTGCCT ATCGTCTGCT GGCCCTGATT GATGTAGCCC GTCCTGGCAC CAGGCTGATT
CCCCAAGTGA GCATTTTGCG GGCTATTCCT TCCGATACGG TGCGTATTGT GGTCGCCAAT
CATTTGGTCG ATTCAGTTTA TTTTCGCCCC TCCCAGGCGA CCGAGTTACG CCACCATTTA
AGCCAACTCA AGGCTTCCGA GCGCCACTGG GTATTTTCAA TGAATGAAAG CCAACAGCTT
CCTTCTGAGA AAGCCACTGC CGGCTTAGCG CCGGAGCGGT ATGCGGCGGT TCTGGAAACA
GCCTATGAGC TGGTGCGCTA TCGGGTCCAA AAGGAGCAAT TATCCCGCGA GGCTACCGCC
GATTTGTCCT ATGGATTGCT GCAAGCGCGC AGCAAGATAC CGGGATCATC CCCTTTATCG
GCTGCGCCCC GGCCCAAGGT GCGGGATGAT CAAGGCCATC CGACCGGCCG CATAACGGCT
ATGGCGGGCC GCCAAGGGCA ACGTCCTTAT CTGGGGTTAT CGCTGCGGCC CGCCTACCAT
GATTTTTCGG ACCCAGCGCC GGGTTACCGG CTTGGCTCCC AGCTTCAGTT TCTAAATGCG
GAGCTGCGTT ATTATTGGGA GAGCAATCAG TTTGAGCTTG AGCAGCTAAA TCTCGTGGAT
ATTATTTCCT TGTCTGCCCG GGATCGCTTC TTCCGTCCCA TTTCATGGCA AGCGGGTTTT
GGCGCGGATC GCCAGTTGAC CAGTGAAGGG ACGCGCCCCT TGGCCGCTTA TCTTAAAGGC
GGTGCGGGGG TGAGTTACGG TATTGCCCAT GGATTGGCCT ATGCTTTAGC TACCGGTACC
GGCCGGATAG GCACTCAGCT CCAGGATAGT TATCAGCTAT CGCCGGGCAT CGCCCTGGGC
TGGAATTTAC AACGGTACTG GGGAAGTTTT CGGGTCGAAG CTCAGAGTCA ATTTTTCCCA
GGCCAGAATG AAGATCCCTA TTATCGGGGT AGCGCCAGTC TGAATTTATA TTGGCCGCCC
GGCTGGGGGC TGACTTTCAA TATAGGACGG GAGAAGAGCG GGGATTTTTA TGCCATTATC
CTCCAAGGCG GCGTTCGGCT TTATTTTTAA
 
Protein sequence
MLRHLGLFCL LYGMCGWAQA EIPTLEELLA AAQQQQLSER PAWWALLHYR PTVLGWTHKS 
QADDPRFFLA KNGKTDPHGE LTATLIALRN GDGQGTHPQC RFPARFYWLQ QQGHPFSRIP
AVPCPQLASW LEQLNTARVT LVFASSYLNS PSSMFGHTFL RLDPPSFNQD SLILASTVSY
AADAAAHDSE IMFAYRGIFG GYPGITTVEP YFDKLRLYSK LENRDLWEYA LNLRPDEVTQ
LLRHVWEIKD KRFDYFFFDE NCAYRLLALI DVARPGTRLI PQVSILRAIP SDTVRIVVAN
HLVDSVYFRP SQATELRHHL SQLKASERHW VFSMNESQQL PSEKATAGLA PERYAAVLET
AYELVRYRVQ KEQLSREATA DLSYGLLQAR SKIPGSSPLS AAPRPKVRDD QGHPTGRITA
MAGRQGQRPY LGLSLRPAYH DFSDPAPGYR LGSQLQFLNA ELRYYWESNQ FELEQLNLVD
IISLSARDRF FRPISWQAGF GADRQLTSEG TRPLAAYLKG GAGVSYGIAH GLAYALATGT
GRIGTQLQDS YQLSPGIALG WNLQRYWGSF RVEAQSQFFP GQNEDPYYRG SASLNLYWPP
GWGLTFNIGR EKSGDFYAII LQGGVRLYF