Gene Noc_2963 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2963 
Symbol 
ID3707345 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp3350171 
End bp3351433 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content56% 
IMG OID637739437 
Producthypothetical protein 
Protein accessionYP_344935 
Protein GI77166410 
COG category[S] Function unknown 
COG ID[COG3864] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGATA AAATAGCGCC TGCGTCCAAA GAGACCAAGC TCCATACTAA ACTCAGTGCT 
GCCCGAACGC GCCTTATCGT GGAACGGCCT TTTCTGGGGG CTTTGGTGCT GCATCTTCCC
TTACGGGAAG CCAGCCCCGA GTGGTGCGGC AGCACCGCCA CGGATGCCCG CGCCATTTAC
TACAATCCTG CCTATGTGGA ATGGCTTTCC TTCGAGCAGC TTCAATTTAT TCTGGCTCAT
GAGGCCCTCC ACTGCGCTTT ATGCCATTTT GCCCGCCGCG GCCGCCGGCA TCTAGGCCGT
TGGAATGCAG CCTGCGATTA TGCTGTCAAT CAATTACTGG TTCGAGAGGG ATTAGAGCCT
CCGCCTGGGG TGTTGCTCAA GCAAGATTAT CGTGCCCTCA GCGCCGAAGA AATCTATCCT
TTGCTCCCCG TGGGAGCAAA ACTGCAAACG GTTGATCAGC ACGTCTATGA CCACGAGGAC
TCGCCCTCCA ATGCCCCCGC TGATCTCAAT CCAGCCAGTC AAGATTGGAG TATCCATGGT
CCAGAGCGGC ACAACCCGCC CGTCTATGAC GGTCGTCTCC TTCCCTCATC CCAAGTCTCC
GCTTCCGGGT CGCCACCCCC ATTAACCGAA ATGGAACGGG AACAGCTTGG CCGCTTGTGG
CAACAGCGGA CTGTGAGTCT TGCCCAGCAA GCCTTGCAGA ACGGTAAGCT AAGCGGTCCA
TTGCGGCGCC TTATTGGGAA TTTAGGGCGC CCCCAGCTTC CTTGGCGGCA GTTACTTGCG
CAGTACTTGA GTGCGGCTGC CCAGAATGAC TACAGCTTTG CCCGCCCGTC CCGCCGCGAA
GGTCCGGCAA TTTTACCCCG TCTCGCCTCT CAGCAAATAG ATCTGGCGAT CGTACTGGAT
ATCAGCGGTT CCATTCATGA TGAGCAATTG CAGAGTTTTC TTACCGAAGT CAGCGCCCTA
AAGGGTCAAC TTTGTGCCCG CGTAACTCTC CATGCCTGCG ATGCCGACCT GTGCGAACTA
GGCCCCTGGA TTTATGAATC TTGGGAAGGA CTGACCCTGC CAGAAAATTT ACCCGGTGGC
GGCGATACCG ACTTTCGGCC TCCCTTTGCC TGGCAAGCAC GTGAAGGACT CTATCCGGAC
GTTTTGCTTT ATTTTACCGA TGCCCGGGGT CCATTCCCGC AAGCAGAGCC CCCTTATCCC
GTCATCTGGC TGGTCAAGGG TAAAGCCCAA GTGCCGTGGG GGCGAAGAAT ACAGCTTAAT
TAA
 
Protein sequence
MNDKIAPASK ETKLHTKLSA ARTRLIVERP FLGALVLHLP LREASPEWCG STATDARAIY 
YNPAYVEWLS FEQLQFILAH EALHCALCHF ARRGRRHLGR WNAACDYAVN QLLVREGLEP
PPGVLLKQDY RALSAEEIYP LLPVGAKLQT VDQHVYDHED SPSNAPADLN PASQDWSIHG
PERHNPPVYD GRLLPSSQVS ASGSPPPLTE MEREQLGRLW QQRTVSLAQQ ALQNGKLSGP
LRRLIGNLGR PQLPWRQLLA QYLSAAAQND YSFARPSRRE GPAILPRLAS QQIDLAIVLD
ISGSIHDEQL QSFLTEVSAL KGQLCARVTL HACDADLCEL GPWIYESWEG LTLPENLPGG
GDTDFRPPFA WQAREGLYPD VLLYFTDARG PFPQAEPPYP VIWLVKGKAQ VPWGRRIQLN