Gene Noc_2931 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2931 
Symbol 
ID3705360 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp3315854 
End bp3317113 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content53% 
IMG OID637739408 
Producthypothetical protein 
Protein accessionYP_344906 
Protein GI77166381 
COG category[R] General function prediction only 
COG ID[COG3550] Uncharacterized protein related to capsule biosynthesis enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.431501 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTTCTA ATAAAGAAGC CTTCGTCTGG ATTTGGCTGC CGGAAGAAAC GAAACCCATT 
GTGGCCGGAC GGCTAGAAGC AGACAACGGC CACATTCTGT TTAACTACGG CAAGAGCTAT
CTGGAGCGTA TCGGGGATCA ACCGCCTGCA ATTCCAATTT ACCAACCCGA GTTGCCGTTG
AAGGCCGGAG TATTGCCACT GCCGAAAGGA TTGACCATGC CCGGCTGTAT TCGAGACGCC
TCGCCCGATG CCTGGGGGCG GCGCGTGATT ATCAATAAGC AATTAGGTCT TAAAGGCGCT
GGCACGGATA CGGCCGAGCT GGGCGAATTG ACCTACCTTC TCGAATCCGG CTCTGACCGG
ATCGGCGCGC TCGATTTTCA ACGCTCACCA TCCGAGTACA TATCGCGTAC GGCGAGCAAC
GTCAGCATAG AAGAGCTGAT CGAGTCTGCT GACCGTGTCG AAAAGGGCGT TCCTCTTACG
CCGGAACTAG ATCAGGCATT ATTCCATGGC AGTTCTATCG GCGGCGCTCG GCCAAAGGCT
TTGATCCAAG ATCAAGGCAA GAAGTATGTG GCCAAATTTT CATCCAGCAC CGATCTCTAC
AGCATCGTCA AAGCCGAATT TATTGCTATG CGACTAGCGG CATGGGCTGG ACTTAATGTT
GCCCCCGTTA AACTGGCTAA GGCAGCAAAT AGGGACGTGC TGCTGATTGA GCGATTTGAC
CGTATTCCGC AAGGCAGCGA TTGGTCACGC AAGGCTATGG TTTCCGCGCT CACGTTGCTT
GGCCTCGATG ATATGATGGC CCGTTACGCC AGCTATGAAA CGCTGGCCGA AATTATCCGT
CATCGTTTTA CCGACCCGAA GAATACACTG AAGGAACTTT TCTCTCGGCT CGTTTTTAAT
ATTCTGTGCG GCAATACCGA CGACCACGCG CGAAACCACG CCGCGTTTTG GAACGGCGAG
GCACTGACCC TGACCCCTGC CTATGACATT TGCCCCCAAG GCCGCACGGG CAATGAAGCG
TCACAGGCTA TGTTGATTGC AGGTAACAAC AACCTCAGTC AACTGAAAAC CTGCCTTGAA
ACTGCACACA ATTTCCTACT CTCCGCGGAA GATGCCCAGG CTATCTTTGG AAATCTGACT
GCCGCCATCG AACAGCATTG GGATGCCGTC TGTGAAGAAG CCGAATTGAA TGAAGTGGAT
AAGAGGTTCT TGTGGGGACG ACAGTTTCTA AACCGCTACG CCACGATGAA TCTCAACTAA
 
Protein sequence
MTSNKEAFVW IWLPEETKPI VAGRLEADNG HILFNYGKSY LERIGDQPPA IPIYQPELPL 
KAGVLPLPKG LTMPGCIRDA SPDAWGRRVI INKQLGLKGA GTDTAELGEL TYLLESGSDR
IGALDFQRSP SEYISRTASN VSIEELIESA DRVEKGVPLT PELDQALFHG SSIGGARPKA
LIQDQGKKYV AKFSSSTDLY SIVKAEFIAM RLAAWAGLNV APVKLAKAAN RDVLLIERFD
RIPQGSDWSR KAMVSALTLL GLDDMMARYA SYETLAEIIR HRFTDPKNTL KELFSRLVFN
ILCGNTDDHA RNHAAFWNGE ALTLTPAYDI CPQGRTGNEA SQAMLIAGNN NLSQLKTCLE
TAHNFLLSAE DAQAIFGNLT AAIEQHWDAV CEEAELNEVD KRFLWGRQFL NRYATMNLN