Gene Noc_1958 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1958 
Symbol 
ID3704972 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2244251 
End bp2245510 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content53% 
IMG OID637738434 
Producthypothetical protein 
Protein accessionYP_343950 
Protein GI77165425 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.441917 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTAAAGC GAGTATTGAT GATCGCTTAT CATTTTCCGC CCGCCAGCGG CAGCAGCGGT 
ATGCAGCGCA CTCTCTCTTT CTCCCGCGAT CTTCCCGAAC ATGATTGGCA GCCCATCATT
CTGAGCATCT ACCCTCGTGC CTATGAACGG CGCTGTGACG ACCAATTGGC TGACATTGGC
TCTGAAACTA TCGTGTATCG TGCTTTTGGG CTTGATACTG CGCGCCATTT GGCATTGGGA
GGTCGCTATA GTCGATTCCT GGCGCTTCCG GATCGCTGGG TAAGTTGGTG GCTCGGAGCT
GTGCCCGCGG GACTGAGGCT TATTCGTCGT TATCGTCCCC AGGTTCTTTG GTCTACTTAT
CCGATTGCTA CGGCCCATCT CATTGGTTTA ACTTTACATC GGCTGAGCGG GATTCCTTGG
ATTGCCGATT TCCGAGATTC CATGACGGAG GATAATTATC CGTCTAATCC ACGGGTACGA
CGCGCTTATC GTGCGGTTGA AGCGGCTACA GTGCACCGCT GCACGCGAGC GATATTCACC
GCGCCTGGGG CGGTGCGTAT GTACGCCGAG CGTTACCCAG AGCGTTCCGA TAAAACATGG
GTTCTTATTG AGAATGGTTA CGAAGATTCT ATTTTTGATA CGGTTTCTTT GCCTTCATTG
AGGGATTCTC CGAGGCCGTT TCGGCTTGTA CACAGTGGGG TAGTGTATCC TAACGAACGG
GATCCCCGTG CATTTTTCGA GGCACTGGCA AGCCTGAAGC GATCAGGCCA GATCACCGCT
CAAAGCCTAC AGGTGGTGTT CCGGGCCAGT GGTTCGGAGG ATTACTTCAG GCAGCTTTTG
CGCGAATGGG GCATTGATGA CATTGTGCAC TTTGAACCCC ATATCCCTTA TCGTGGGGCA
CTTGCTGAGA TGCTTACAGC TGATGGACTT TTGATTTTAC AGGCAAGCAA TTGCAACCAT
CAAATTCCAG CAAAGCTTTA TGAATACCTG CGCGCACGGC GGCCGATTTT GGGGCTTACG
GATTCTGAGG GAGACACTGC GAGGGTGTTG CGGCAGGCGG GTATTGAAAC CGTTGCCCCC
CTTGATTCGG CTGCGGCTAT CACGGCAACA CTGCAAGATT TCCTTAAGCA ACTTCAGGAT
GGCACAGCCC CCGTGGCAAG CGAAGCAGAG ATCGCTCGTA GCTCAAGGCG TAGCCGTGTA
GCATCCCTAG CTGAATGCTT GGAGGAGACG ATTGCCACGG ATCTCAACTT TGAGAGGTAA
 
Protein sequence
MVKRVLMIAY HFPPASGSSG MQRTLSFSRD LPEHDWQPII LSIYPRAYER RCDDQLADIG 
SETIVYRAFG LDTARHLALG GRYSRFLALP DRWVSWWLGA VPAGLRLIRR YRPQVLWSTY
PIATAHLIGL TLHRLSGIPW IADFRDSMTE DNYPSNPRVR RAYRAVEAAT VHRCTRAIFT
APGAVRMYAE RYPERSDKTW VLIENGYEDS IFDTVSLPSL RDSPRPFRLV HSGVVYPNER
DPRAFFEALA SLKRSGQITA QSLQVVFRAS GSEDYFRQLL REWGIDDIVH FEPHIPYRGA
LAEMLTADGL LILQASNCNH QIPAKLYEYL RARRPILGLT DSEGDTARVL RQAGIETVAP
LDSAAAITAT LQDFLKQLQD GTAPVASEAE IARSSRRSRV ASLAECLEET IATDLNFER