Gene Noc_1457 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1457 
Symbol 
ID3706026 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1612979 
End bp1614130 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content55% 
IMG OID637737946 
ProductYVTN beta-propeller repeat-containing protein 
Protein accessionYP_343475 
Protein GI77164950 
COG category[S] Function unknown 
COG ID[COG3391] Uncharacterized conserved protein 
TIGRFAM ID[TIGR02276] 40-residue YVTN family beta-propeller repeat 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.803154 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA TTGATGGTTT TATGGTGGCT TTGTTAGCAG GGGGAGCAGG GCTGATGCTA 
AGTGCTACGG ATAGTGTTCT AGCCGCGGAA AGCAGCGGTT CAGGGCAGGA TAATCAAGGG
CTCCAGGCGA AGCTTTATGT CACCCTGGAG GAGCCGGATG CGTTGGGAAT CGTGGATCCA
AAGTCCAAGA AAGGGCTAGG GACGGTTGCT GTCGGTGGGA AGCCCCATGA TGTGATTTGC
GCCCCGGATG GGGCGACGGC CTATGTCACC AATCCAGAAA CCCACAACCT GAGCGTCGTG
GATACAGCAA CGGATAAAGT CAAAAAAACC GTGGAATTTG GCCAGGGAAC GACCCCCTGG
CATGTGGAAA TATCCCCGGA TGGTTCTCAG GTCTTTGCCG CCCTCCAGGA TCAATCAGCG
GTGGCGATTA TTGCTACCGC TGATAATCAT TTAGCGACCA AAGTTTCGGT GACAAGCGGT
CCCTGGGGGG TGGCTGCCCC TAAAAACGGA CCTTGGGCGG TGGCCGCCCC AAGGAATGAT
GTCGTTTACG TCACGCTCAA TGGGAGCATT ACCAAAGGCA CGGCGAATGC AGCCCGCAGT
GAAGATATTG CTGTGTTTGA TCCAACGGCG GCTGTACCTA CCGTGAAATA TGTGACCCTG
GCGGCGGACA CGGCCAATGG ACCCCACGGG ATCGTATCAG CGCCGGATGG ATCGGCGGTT
TATGTCGCCG CCGAGGCAAG TCATGAAGTA TGGAAAATCC AGGTGGACGG CAACCAGGCA
AAGCGGGTAG TCGAAATCCC TGATCCTAAT CCTGCTGGAA CGCCTCTTAA CCCAGGCTTC
CCCACGGATC TGGGCATCAG CCCCGACGGT AATACGCTCA TTGCCGTTAA CCACGACCTT
GACTCCATAA CGGTAATTAA TCTGAAAACC CATAAAATCA TCGAGACGGT CAGCACGGGT
GAAGGCAGCG CGCCCTGGGG AGCGTTGATT TCGCCGGATG GACAGACGGC TTATATCTCC
ACCAACGGCG CGGACAGCCT TGCCTTTTTT TCCATGAAGG AACTCACCGA TGGGACGGAA
GGGGCCCGCA AGCATACCAT CGCGGATTTG CCCACTTCAG ATGGCCTCGC GTGGTGTAAT
TTGGCCCAAT AG
 
Protein sequence
MKKIDGFMVA LLAGGAGLML SATDSVLAAE SSGSGQDNQG LQAKLYVTLE EPDALGIVDP 
KSKKGLGTVA VGGKPHDVIC APDGATAYVT NPETHNLSVV DTATDKVKKT VEFGQGTTPW
HVEISPDGSQ VFAALQDQSA VAIIATADNH LATKVSVTSG PWGVAAPKNG PWAVAAPRND
VVYVTLNGSI TKGTANAARS EDIAVFDPTA AVPTVKYVTL AADTANGPHG IVSAPDGSAV
YVAAEASHEV WKIQVDGNQA KRVVEIPDPN PAGTPLNPGF PTDLGISPDG NTLIAVNHDL
DSITVINLKT HKIIETVSTG EGSAPWGALI SPDGQTAYIS TNGADSLAFF SMKELTDGTE
GARKHTIADL PTSDGLAWCN LAQ