Gene Noc_1833 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1833 
Symbol 
ID3705077 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2082240 
End bp2083322 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content45% 
IMG OID637738314 
Producthypothetical protein 
Protein accessionYP_343831 
Protein GI77165306 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAAACTA TCGCTTACTT TATCACCTCC CATGGTTTTG GCCATGCGGC GCGAGCCTGC 
GCGGTCATGG ACGCATTACG GAAAAATCTT TCTAAAATCC AATTTGAAAT TTATACTCAG
GTTCCCCCTT GGTTCTTCCA AGGCTCCCTA CAGGGAAATT TTAATTATCA TGCCTTGCTA
ACCGATATTG GCATTGTTCA GAAATCGCCC TTACATGAAG ATTTGAAGCA CACGCTACAT
GCTCTTGACC GCTTCCTTCC TTTTAACCCC TCTCAAGTGG TTAGACTTGC CAAGCAAATC
CAAAAAAACC AATGTGAGCT TATGCTATGC GATATCTCGC CTTTGGGAAT CACAGTAGCC
AAGGAAGCAG AAATACCGGC AGTTCTCATT GAAAACTTCA CCTGGGATTG GATTTACCAA
AGTTACCTTA GCCAGCATCC AGAAATCGGC AAATATATTG ATTTTTTTAA GGGTTTATTT
GAAACAGCGG ATTATCATAT TCAAACTCAA CCAGCTTGCC ACCCCACACC GGTTGATTTA
ACCACTCCGC CCGTTTCCCG AAAAGCCAGA CTTTTTAGGC AACAAATTCG AAGCAAACTC
AGCCTGCCTC AGAATGCTCC AGTTATTTTA ATTAGCATGG GCGGCATTCC CCCAGGACAC
TACCCCTTTC TAGCGCAAAT AGCTGCCCAG CAAGCCCTGT TTTTTATCAT TCCCGGGGTG
GGTGAGAGTC TACAACGTCA TGAAAATATA GTTTTTTTAC CCCACCATTC CGAATTTTTC
CATCCGGATT TAATCCAAGC CTCGGATCTG GTATTGGGAA AGCTAGGCTA CAGTACCCTG
GCCGAAACCT ACTGGGCGGG CATCCCATTT GGCTATATCA TCCGAACCCA TTTCCGGGAA
TCAGAGATAC TCGATAAATT TGCGAAAAAA GAGATGGACG GTTTTGCCAT GAATGAAACC
CAATTTAACA GGGGCGATTG GATTTCCCAA TTACCTTATT ATCTCAATTT ACCTTCCCGG
GAGCGTTTTG GCCCTAATGG CTCTGAGCAA ATAGCTCGCT TTATTCTCTC TTTGTCACCA
TAA
 
Protein sequence
MKTIAYFITS HGFGHAARAC AVMDALRKNL SKIQFEIYTQ VPPWFFQGSL QGNFNYHALL 
TDIGIVQKSP LHEDLKHTLH ALDRFLPFNP SQVVRLAKQI QKNQCELMLC DISPLGITVA
KEAEIPAVLI ENFTWDWIYQ SYLSQHPEIG KYIDFFKGLF ETADYHIQTQ PACHPTPVDL
TTPPVSRKAR LFRQQIRSKL SLPQNAPVIL ISMGGIPPGH YPFLAQIAAQ QALFFIIPGV
GESLQRHENI VFLPHHSEFF HPDLIQASDL VLGKLGYSTL AETYWAGIPF GYIIRTHFRE
SEILDKFAKK EMDGFAMNET QFNRGDWISQ LPYYLNLPSR ERFGPNGSEQ IARFILSLSP