Gene Noc_0621 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_0621 
Symbol 
ID3706853 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp668471 
End bp669730 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content53% 
IMG OID637737129 
Producthypothetical protein 
Protein accessionYP_342670 
Protein GI77164145 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCGATG AAGAGGAAAC ACTGCGGCCG GAGACCATTG CCCTCGATGC GTTAATCAAG 
GCGACCGTTG CCAAGATTCG TGAACAGACT GATCCGTCGC ACCCGGACGC CATGTTGTTT
ATGGGTAATT GGCACCAGGC CGTTCCGGCA ATGGTCATTC AGGATCCGGT GCTGGAACCC
GTCGATAAAC TGGTGTGGAT GGTCATCATG CTGCATGCCA GAGAGACTGG CGGGCGAACG
GCATTTCCCG ACTACGATAC GATCGCCAGT AAAACCAATG TTTCCTCAAC CTCCACGGTC
TCGCGCGCTA TTGCGATCCT GCGTTTGACG CGCTGGCTGA CCCTGTGCGC CCGGATCCGG
CAAACGAGCG GCCGCTTCAC GGGTAACGTC TACATACTCC ACGATGAGCC GTTGCCACTG
GTCGATGCCA TCTATCTGGA CGATGCCTAC ATGGCGTTCG TGACTCAATC TCAGGAGCAC
CATCACGCGC GTGTCCGCCG CGTGGCTCAG GCTGTGACAG CGAGTCTCGA TATGGATATT
CGTCGGGGTG AACATTTGGC GGACCAGGAA TCAGCGATCG AGCGTCGATT ACAGGCAGTG
AAGATGCTGG CGGATACCAG CAGAAATAAC GACAAAAGCG GTCGCTATTT TACCTTCAAC
GCAGCGGCAC TAAGCCAGCT GAAAAATTCG TCAGACACTG GAATTGCAGA GCAATCAGAC
CAGCACCAAT TTTCAAAGGC GGAGACGAAG ACGCACTACA GTAGTGGTTG TAGTAGTCAT
TATAAAAAAA CAACTACAAC AACCACTACA CAAAATACCC ACAATGAAAA GAAAGCATTC
TCCGAATCCA GCCAATCAAT ACCGTTGCCA ACGGATCAAA CGCTGATCTA TCCACCACGT
TTGTCGGAGA ACCAGAAACT CCTGGCTGAT AGGTATCTTG CGATGATTGC GCCCGAAGAC
CGGCAGTTGG TGCTGGATGA ACTGCAAGGC CGCCTGTCCT CTGAGCAAAA GGGTATGAAG
CCCGTCTACG ACGAACTGAG GTTTTTGCAC TCGCTGTGCA AGGCTGCGCA AAAAGATGAA
TTTGTGCCTA ACCTGGGCAT CAAGGTGGCG GAGGCTCGAA AAGAGCGGGT GCGTCATGTT
CAACCGCCGG AAGATGAAAC GCAGAAAGCC CAAACCGCCG AAGAACGAGA ACGCTCCCAG
GCCTATGCGC GTGAGCAACT GGCCAAGTTG CGCGCATCGT TGAACATGGA CAAAAAATAA
 
Protein sequence
MADEEETLRP ETIALDALIK ATVAKIREQT DPSHPDAMLF MGNWHQAVPA MVIQDPVLEP 
VDKLVWMVIM LHARETGGRT AFPDYDTIAS KTNVSSTSTV SRAIAILRLT RWLTLCARIR
QTSGRFTGNV YILHDEPLPL VDAIYLDDAY MAFVTQSQEH HHARVRRVAQ AVTASLDMDI
RRGEHLADQE SAIERRLQAV KMLADTSRNN DKSGRYFTFN AAALSQLKNS SDTGIAEQSD
QHQFSKAETK THYSSGCSSH YKKTTTTTTT QNTHNEKKAF SESSQSIPLP TDQTLIYPPR
LSENQKLLAD RYLAMIAPED RQLVLDELQG RLSSEQKGMK PVYDELRFLH SLCKAAQKDE
FVPNLGIKVA EARKERVRHV QPPEDETQKA QTAEERERSQ AYAREQLAKL RASLNMDKK