Gene Noc_1795 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1795 
Symbol 
ID3705312 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2023839 
End bp2025881 
Gene Length2043 bp 
Protein Length680 aa 
Translation table11 
GC content48% 
IMG OID637738279 
Producthypothetical protein 
Protein accessionYP_343796 
Protein GI77165271 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0505644 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCTACA CAGCGAAAAA TCAGGGCACA GCGGAATCCT TGGAGGACGA CCCTTCTATT 
TTAACGTGGG AACAAATCCT GGAGCAAGAG CGTATGCGCA TCAGGGAGAG GCGCCGCCAG
GCGGGAATTG CTCTAGGCCA GCCAGAAAAA GATGCGGTAG GGCTGGCTTT TTCGGGTGGC
GGGATTCGCT CTGCCACCTT TAATCTGGGG TTATTGCAGG CGATGAATCG CTACAGTTTT
CTTAAGCATG TGGATTATCT TTCCACTGTC TCGGGCGGAG GTTATATTGG CAGTTCCCTG
ACTTGGTTTA TGTCTTGCCT CAAGCAAGAC TTTCCCTTTG GCGCTTCCCG CCGCGACAAT
AGGGAAACGC CAGGCGCTAT CGTGAGTTGG CTCCGTCAGC ACGCCTCTTA CCTCACGCCG
GGTGAGGGGG TGGATCTTTG GGCCTTGGCA GCGGCTATTA TACGGGGCAC CTTGGTTAAC
TTGCTGGTGA TTATTCCGAT TTTCTTTACT ATAACCGTGC TCTTGGTTTG GTTGCCCGTC
CCAGTGGTTA TTCCAGGTTA TCCTTGGAAT GGTTTTACCC TGTTGCTAGG AGCAGGCCTT
GCCTCTCTCG CTCTGCTTGG CGTCACTTCC ATTTTCTATG CTCTATTTTC TAATGTTAGA
AGCTTACAGC GATTCCGTGT ACGGAGCCGG AGCAATTTCT GGATGGGACG GATGCTGTTT
TTTGGAATAG GATTTGCGGT GCTCGGTACG ATTCCTCTCT TGCATGGTTA TCTGGAGAGT
CATTTTAAGG ACTTGATCGA AGAATTTTAT ACCTCTTTTT CCCTGGCTGG GGCGCTATCT
CTCATGGGTG GCTGGATTGG CCGGGATTCG GAAAACGAGA CTCAGGGCTA TCGCAAGGTT
TTGTTAAATG TGGGACTAGC TTTGATAATT TATGGTCTTT TGCTATGGAT GTACCATGAT
GCTGATGCCG TTGTGAACAA GGGTGATGTA GTGAGAGAGG AATTGTTATG GGCAGGCGTG
GCCTTATCGC TTTTTATTGG TGTGCTAGCC AATATTAACT ATGTCTCTAT CCACCGTTAT
TACCGGGATC GGCTGATGCA AACATTTATG CCGCCGGTGG GGTTCACTGA TTTTAGGGAA
CCCAATACAT GTTTGTTAAA AGACATTCCC CAAACCAAAG CCCCCTACCA AATTATTAAT
ACTATGATGA TGACCTGGAA TTCTTCCACC CCCGCACTGC GGATCAGAGG AGGGGATAAT
TTTATTTTTA CTCCTTTATT TTGTGGCGCC CCGTCCACCG GCTATGTTCC AAGTGCCCAA
TACCTGGGTG GCACGATGGA TCTGAGCACC GCCTTTAGTA TTTCGGGAGC TGCCATCGAT
CCCAATACGG GAGTAACTCG ATCCCGGCCA TTATCTTTTA TGATGACTTT ATTGAACTTA
CGGATAGGTT ATTGGGTTCG TAATCCAAAG CGGCCTGCCA ATAGGATAAA GGGGTGGTCG
CGTCCTTATT GGTTTGTCTA CTCCTTGCGG GAAATGCTTG GCCTTAAAAT GGCTGAGAAT
CAAATGCATG TTTATTTGAC CGACGGAGGC CATTTTGAAA ATTTAGGTCT TTATGAATTG
GTGAGGCGCC GATGCCGTTA TATTGTGCTT TCCGATGCGG CTGAAGATCG GGCTTGGAAG
TTTGATGATT TAGGGAATGC CTTGGAAAAG ATTCGGGTAG ATTTTGGTGT AGCTATCGAT
ATTGATACTC AAATGCTACA ACCTCAGGGT CTTAATCAAT TTTCTCCTCA GCCGGGAGTG
CTAGGAAATA TTTGCTACGC GGATGGGAGT CGAGGGACTT TACTTTATAT CAAGGCTTCC
GTTTTTTCCG GACTTCCAGA GGATGTCTAT GCCTACCGGC GGGCTAATCC CAAATTTCCC
AATCAAAGTA CGGTTGATCA GTTTTTCGAC GAGCCCCAAT TTGAGGCTTA CCGGGAGTTA
GGCTTTCAAG TAGGCAAGCG AATATTTGAG GATAAAAAAC TCCGTAAAAT TTTTGCTTCC
TGA
 
Protein sequence
MSYTAKNQGT AESLEDDPSI LTWEQILEQE RMRIRERRRQ AGIALGQPEK DAVGLAFSGG 
GIRSATFNLG LLQAMNRYSF LKHVDYLSTV SGGGYIGSSL TWFMSCLKQD FPFGASRRDN
RETPGAIVSW LRQHASYLTP GEGVDLWALA AAIIRGTLVN LLVIIPIFFT ITVLLVWLPV
PVVIPGYPWN GFTLLLGAGL ASLALLGVTS IFYALFSNVR SLQRFRVRSR SNFWMGRMLF
FGIGFAVLGT IPLLHGYLES HFKDLIEEFY TSFSLAGALS LMGGWIGRDS ENETQGYRKV
LLNVGLALII YGLLLWMYHD ADAVVNKGDV VREELLWAGV ALSLFIGVLA NINYVSIHRY
YRDRLMQTFM PPVGFTDFRE PNTCLLKDIP QTKAPYQIIN TMMMTWNSST PALRIRGGDN
FIFTPLFCGA PSTGYVPSAQ YLGGTMDLST AFSISGAAID PNTGVTRSRP LSFMMTLLNL
RIGYWVRNPK RPANRIKGWS RPYWFVYSLR EMLGLKMAEN QMHVYLTDGG HFENLGLYEL
VRRRCRYIVL SDAAEDRAWK FDDLGNALEK IRVDFGVAID IDTQMLQPQG LNQFSPQPGV
LGNICYADGS RGTLLYIKAS VFSGLPEDVY AYRRANPKFP NQSTVDQFFD EPQFEAYREL
GFQVGKRIFE DKKLRKIFAS