Gene Noc_1953 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1953 
Symbol 
ID3704967 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2237655 
End bp2239019 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content47% 
IMG OID637738429 
Producthypothetical protein 
Protein accessionYP_343945 
Protein GI77165420 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.611108 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAAAGG AAAGGGAATC TATTTTACTC TCCGAAAGGC AGGCCGTATG CCAAGGATTA 
TTTTGGCTTT TAGGAATCTG CATCCTCGTG AGGGTGTTCC TGTGGATTGT CTATGCGCCG
GTGATATGGC CTGATACGGG CACTTACCTG CAGCTCGCCG GAGAGTTGGT TCGCCTGGAC
TTTAGCGAAT ATGATGGGAC CCGAACTCCC GTTTACCCGC TGCTGTTGCT TCTTGCGGGG
AAAAACCCTT TTGGTGTCTG GTTTCTTCAA TCCGTGCTTG GGGTTTCGGC CTCGCTTCTT
TTATTTGTAT TGGTATTTGA GCAGACACGG AAAGTGAGCC TTGCTTTCAG CGCGGGGACC
ATTCATTCCT TGGCCCTAAA TGAGCTGTTA TTTGAGGCAA ATCTTCTGAC GGAGTCGGTA
TCGTCCTTCC TGCTGATATT CTCAGTATTT ATTTTTTCAA AGCTGCTAGT AAGTGAACGG
AAATTTTGGC CGGCGGTTGG TCTTGGGGGC GCAAGCTCGC TTTTGGCGCT CACCCACCCC
CTCTTTGCTT TTGTTGGTCC TTTGTACATC TTGCTAGCGA TCTTGTTTTT TCGAGGGCGC
AATCGGATTT GTATCCCTCT CATCGTCTTG CTCTGTTTTG CTTTACCCGT ATCAGGGTGG
ATCGGCTTTA ACAAGATAAC TCTTGATTAT GCTGGAATCA CAACTTTTAT GGGGTATAAC
CTCAGCAATC ATAGCGGTGG TTTTATCGAG CGGGCGCCGC CGTCCAAGTT ACGAGATATC
TATCTCCGAT ACCGGGAGAA AAAAGTGAAG CAGAGCGGGA ATCACAGTAT GACAATCTTT
GAGGCAAAGG GTGAGATTAT GGCAGCAACG GGCCTAAGTC ACGTGGATCT CTCGCGGAAG
CTTGCCCGCC TCTCGCTCCA ACTTTTTGTG GAGCATCCTG CCTTATATTT GCAAAGCGTC
TTTAAATCAT GGGTTTCCTT CTGGGTTGCG CCTAATTACT GGAAACCCAC CAAAATAATA
TCCCCTACAG TGGCCGCCTC CCTTGAGCGC CTTTGGTGGG TTGAACAAGG GCTGATCAGG
GTTATGAATC TGTTATTTAT ATTTTTTGCT GCTGGCCTTA TTGTAAAGGT TGCCCTATTT
CAAGGTTATG CTGAACCTGC GTTACGAGTT CCGCTAGTTA TTACCGTACT CATCATTGTT
ACCTCCTTGG TGCAAGCATT ATTTGAATAC GGTAACACAC GCTATTCGAT TCCTACCCAA
TCACTGATGA TTACTTTTAT ATTGCTGTTT GGGCGGCAGA TAAAAGTGCA TCGTTGGTTA
GGGATAGTTT ACGTACGTTT GAAGTCTAAT TGGTTCCAGG CGTAA
 
Protein sequence
MIKERESILL SERQAVCQGL FWLLGICILV RVFLWIVYAP VIWPDTGTYL QLAGELVRLD 
FSEYDGTRTP VYPLLLLLAG KNPFGVWFLQ SVLGVSASLL LFVLVFEQTR KVSLAFSAGT
IHSLALNELL FEANLLTESV SSFLLIFSVF IFSKLLVSER KFWPAVGLGG ASSLLALTHP
LFAFVGPLYI LLAILFFRGR NRICIPLIVL LCFALPVSGW IGFNKITLDY AGITTFMGYN
LSNHSGGFIE RAPPSKLRDI YLRYREKKVK QSGNHSMTIF EAKGEIMAAT GLSHVDLSRK
LARLSLQLFV EHPALYLQSV FKSWVSFWVA PNYWKPTKII SPTVAASLER LWWVEQGLIR
VMNLLFIFFA AGLIVKVALF QGYAEPALRV PLVITVLIIV TSLVQALFEY GNTRYSIPTQ
SLMITFILLF GRQIKVHRWL GIVYVRLKSN WFQA