Gene Noc_2530 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2530 
Symbol 
ID3704686 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2882118 
End bp2883986 
Gene Length1869 bp 
Protein Length622 aa 
Translation table11 
GC content52% 
IMG OID637739009 
Producthypothetical protein 
Protein accessionYP_344513 
Protein GI77165988 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000044535 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAT GGTTTCTCTA CAACAGTATC GCATTAGTAC TATTCGGGCT ATCCATGGGG 
AGCTATGGAC GCGAAATCCA AGTGCCTATA CAACTCAATA ATGAGCTGCT TCGCCATATT
CTCATTAGGG AAGTTTATGT CGGTCCCCAT CATACGGCCC AGGTCTGGAA TGATGATAGC
GGCTGCAATT CTTTAGTGCT TTCCAATCCC CGGGTCGGTA ATGCCGGCCA GCAGCTTCGC
ATTCTCAGTG ACGGAATAGC CAAGCTCGGA ACGCCCATTG GCAATCGCTG TATTCCACTC
CTAGACTGGA CAGGAACGAT TGAAGTATTC CAAAAGCCCA TGCTTGGACC CGAACTGACT
ACCCTCTACT TCCAAACTGT ACAATCCAAT ATTTATAACG CCGAAGGGCA TAAGGAAGCT
GCCACGGGTC AGCTCTGGGA TAGGATCAAA GAATATGTAC ACCCCAGGCT CTCGCAGGTC
CGGATTGACC TCCAGCCCCA GTTAGCGGAA CTGCGAAATC TACTGCCCTT GGTATTATCT
CCCCGGGACC GCTCACGGAT TCAGACCGCC ATTGATTCCC TAACCCTGAC CGAAGCACAA
ACGACCCCGG ACGGGATTAA GGTCGCCTTG CGTTTTACAC TCCCGGATCT CAACACTCCC
CCTCCTTCTC CGGAACCGCC CCTCTCACCT GAGGAAATGC AGCGCTGGGA AGCAGCGTGG
CAGCAAGGAG ATGCGTTTCT AACCTTTATT ATCAAGCAGG CAGCAGCTGA AAATGAGTTA
GCAGAATTAC GCCCGCTTTT GTTGGAAATT CTTTTAGACG CCCGTCACGA TATAGACAAG
GCCCTCACTG CTTCAACGCC TGGAACGGCC GATCCCATCC GCACGCTGTT TCTAAAAACC
TGGGAACGCC TGGCACCGGT ATTGCGCCAT TTAAGCCTGA GCATGCCCCA TGAAACGGCC
CTGCACTACC TGAGCTTTAT TGCCGCCAGT GATGTCCTAA AAACCATTGA CCAGCTTGGC
CCGGCGTCTG GCCTCGATAT CTCCACCGAT GGGTTACGCC GGCTAGCCCG GATCATTGCT
CCCCAAGCGG GGCACCATCC CTTGTTTTAC AACTTTAAGG TCGATCCTGA GTTGCGGCGC
TTATTGGGAT TTAGCGTGGC TCCATCCCCT TCTCGGAAAA ACTCGCAACT CAACCTGAAC
GAGGGGTTAT GGCGAAATGC CTGGGCTGCT GATAGCGTTG ATCGACCCCT GATTTCACGG
CTCAACCAGT GGGTTCCCAC GACAAAGGAT ATGGGAACCT ACTTACCAAT GGTCCACCAG
CTCCTGGATC AAACCGTCAG CCATTTACTC CAAACTCATC CCTTGGAAAG CCAATACCAC
TCACTGTACC GCTGGCTCCT GCTGGCTACC GCTTGGCAAG AAAGCTGTTG GCGTCAATTC
ACCAAGAAGG GAGACAAGAT CCGACCGTTC CACTCTGGTG GAGGCTCGGT AGGTCTCATG
CAGATCAATC AAAACGTCTG GCGAGGTTTT TACGATGTGC ATGACTTGAA CTGGGACATT
GCCTACAATG CGCAAGCGGG GGGAGAAATC TTGTTGCGCT ATCTGGTAGA TTACGCCATC
AAAAAAGGCG AGCATAAAAA AACAGGTGAT CTCGATAATC TAGCTCGCGC CACCTATGCC
GCTTATAATG GAGGACCAGG ACACTTGAGG CGTTACCGTA AGGCAGGTAC GCCAGAGTCC
TTGCGTAAGA TAGATGCTTC TTTTTGGGAC AAATACCGAA CCATCAAACA GGGTAACGAA
CTGGCTGTCG CCCAATGCTT TGGTATAGAA GCCTCGTCTT TATCGCTTCC GCCAGGAAAC
AGAAGATAA
 
Protein sequence
MKKWFLYNSI ALVLFGLSMG SYGREIQVPI QLNNELLRHI LIREVYVGPH HTAQVWNDDS 
GCNSLVLSNP RVGNAGQQLR ILSDGIAKLG TPIGNRCIPL LDWTGTIEVF QKPMLGPELT
TLYFQTVQSN IYNAEGHKEA ATGQLWDRIK EYVHPRLSQV RIDLQPQLAE LRNLLPLVLS
PRDRSRIQTA IDSLTLTEAQ TTPDGIKVAL RFTLPDLNTP PPSPEPPLSP EEMQRWEAAW
QQGDAFLTFI IKQAAAENEL AELRPLLLEI LLDARHDIDK ALTASTPGTA DPIRTLFLKT
WERLAPVLRH LSLSMPHETA LHYLSFIAAS DVLKTIDQLG PASGLDISTD GLRRLARIIA
PQAGHHPLFY NFKVDPELRR LLGFSVAPSP SRKNSQLNLN EGLWRNAWAA DSVDRPLISR
LNQWVPTTKD MGTYLPMVHQ LLDQTVSHLL QTHPLESQYH SLYRWLLLAT AWQESCWRQF
TKKGDKIRPF HSGGGSVGLM QINQNVWRGF YDVHDLNWDI AYNAQAGGEI LLRYLVDYAI
KKGEHKKTGD LDNLARATYA AYNGGPGHLR RYRKAGTPES LRKIDASFWD KYRTIKQGNE
LAVAQCFGIE ASSLSLPPGN RR