Gene Noc_1610 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1610 
Symbol 
ID3705733 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1799590 
End bp1801029 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content53% 
IMG OID637738086 
Producthypothetical protein 
Protein accessionYP_343615 
Protein GI77165090 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCAGCTTA AAAATCGTAT TTACCGGCCA TGGGCCACAG TCGGTTTGGT TTTTTTATTG 
ACTGCCTGCG GATCAGGTGA GGAATCTAAA CCCGAGTCAT CCTCGACCAT TAAAACGGCT
TCTACCGGCT TTGTTCCGCC TGAACCCCAA CGGCAGGGCG ATCCGGAAGC GGGCTACGAT
GCCATCATCA ATAAGGCTTA TGTGACCTGT GGCATTCCCT ATGATGCTTA TAAGAAAAGC
GATCTGGACC GCCCTCCCGA ATACGCACTC CCTGGACGAG AGGAGCCTAA TGCCAAACTG
CCCTACAACC AAACCTACTA CCAAAAAGAG GACGGGACTG AACTGGTGGT TGGCAATTGC
CTCACCTGTC ATGGCGGAGT CTTCAATGGT GAGCTTATTG TGGGTCTAGG CAACGAAGCG
GCGGACTTTA CCGGCGATCC TGCGGCTTCC GCCGAGCGCA TGGGCGTTTA TGTCCAGGGA
GAAAAAGAAA CGGCCGCATG GCGTAAATGG GCCGATCGAA TCGCAGCCAT TGCCCCCTAT
ATGAAAACCG AGGTCATCGG CGCTAATCCA GCGGTTAACC TGACCTGGGC CCTTTTTGCC
CACCGGGATC CGGAAACCCT CGCCTGGTCC CAAGAACCCC TGATAGAACC CCCACCCAAA
GAGCCCGTCC CCTTAAGTGT CCCCCCCTGG TGGCGGATGG AGAAAAAACA TGCCATGTTT
TACTCGGCCG AGGGTCGGGG CGATCATGCC CGGATGATGA TTCTGGCCTC TACCTTCTGC
ACGGACTCAG TGGAAGAAGC GAAAGTAATT GACTCCTATG CTCCCGATAT TCGCGCCTAT
ATTGCTTCTC TTGAACCCCC TGTCTATCCT TTCCCCATTG ACCAGGCACG GGCTAAACAG
GGCCAAACCG TGTTCGAAAC CCACTGTGCT CGCTGCCATG GCACCTATGG TGAAAACGAA
AGTTACCCTA ACCTGGTCAT TCCCCTAGAG GAAATTGGCA CGGACCCAGA ATATATCCAA
TCCACCATGG GGAAACAACT GGATCGTTTC GGTCATTGGC TGGCGCAATC TTTTTATGGT
GAAAATTCCC ATTTTAATCT ACCTGCGGGG GGCTATCTTG CCCCGCCCCT GGATGGAGTT
TGGGCCACCG CTCCTTACCT GCACAATGGT TCCGTTCCCA CCATTGAAGC CTTGCTGAAA
AGCGACCTGC GGCCTACCTA TTGGACCCGC TCTTTTGATT CCAAGGATTA CAATGATAAA
GCCCTGGGCT GGCATTACAC AGAGCTCACC TACGGCAAAG AAGGCGCAAA AGACGCTGAG
CAACGCAAGC GCCTCTATGA TACGACGCTG CGAGGATATG CCAATGGAGG CCATACGTTT
GGAGATCAGT TAGCAGAAGA AGAGCGGCGC CAGGTTTTAG AATATCTAAA AACTTTGTAG
 
Protein sequence
MQLKNRIYRP WATVGLVFLL TACGSGEESK PESSSTIKTA STGFVPPEPQ RQGDPEAGYD 
AIINKAYVTC GIPYDAYKKS DLDRPPEYAL PGREEPNAKL PYNQTYYQKE DGTELVVGNC
LTCHGGVFNG ELIVGLGNEA ADFTGDPAAS AERMGVYVQG EKETAAWRKW ADRIAAIAPY
MKTEVIGANP AVNLTWALFA HRDPETLAWS QEPLIEPPPK EPVPLSVPPW WRMEKKHAMF
YSAEGRGDHA RMMILASTFC TDSVEEAKVI DSYAPDIRAY IASLEPPVYP FPIDQARAKQ
GQTVFETHCA RCHGTYGENE SYPNLVIPLE EIGTDPEYIQ STMGKQLDRF GHWLAQSFYG
ENSHFNLPAG GYLAPPLDGV WATAPYLHNG SVPTIEALLK SDLRPTYWTR SFDSKDYNDK
ALGWHYTELT YGKEGAKDAE QRKRLYDTTL RGYANGGHTF GDQLAEEERR QVLEYLKTL