Gene Noc_1379 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1379 
Symbol 
ID3706104 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1528060 
End bp1529241 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content52% 
IMG OID637737874 
Producthypothetical protein 
Protein accessionYP_343403 
Protein GI77164878 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAATA GGGACTCGAC GGAACAGCAA TTGCGTACCG AAGCCGATGC CCTGTGCCGG 
GATCATCGCT TCAGGCGGGC GATTTGGGAT CACTTTCAGC GGCCAGGTAA TAAAAGCATG
GGGATGGCGC TCAATACCGC GATCCATCCC AATGATCAAA TGCTTATCCA TTCACTGCGC
CACCATCGGG ATCCCAATGC GGCCTTGAGT CAGTATTACA ATGTGGCGCT GCAGCAACAT
TTTGCCGCCC AACAGATTCT GCAAGCCTTT TTTCCAAACC CTGGGCCAGA TTTCGCTTTT
CTAGATTTTG CCTGTGGATT TGGCCGGTTA GTTCGATTGC TGACTCTCAG TTTGCCCGCA
GCTAATATCT GGGTCGCGGA AATACAAAAA GATGCCCTTG CTTTTGTCAC CCAAACATTC
AATGTTCAGG CACTGGAGTC CAGTGCCAGT CCGGAGCAAT TCCAAGCCGG GAGGAAATTT
GATTTTATTT GGGTGGCTTC TCTTTTCTCC CACCTGCCGC CAGGGCTATT CCAGCGCTGG
CTGGAGCGGC TTCTGTCCTT GCTGAACCCA AGTGGCATTC TCTGTTTCAG CGTCCATGAT
CAAGCGCTTT TGCCTGTGGG AGTCGGCCTG CCGGAGGTAG GTATTCTATT TAATCCCCAC
AGTGAAAATG CTGAATTGGA CCCTAAAACG TATGGTACCA CCTTCGTCGG CGAAGATTTT
GTAAAGCGTG CCATCCACGA GGTAAGCGGT GAGGGCCATC CCTATTTTCG TATTCCCAAG
GGGTTGGCCC AGGAGCAGGA TCTCTATGTG GTGGCTAAAT CTTCGAGTTT GAATCTATCA
GGATTGCAAG CCTTTCGTTA TGGTCCCTGG GGCTGGGTGG ATGAGCGGCG TATTTTAGAG
TCGGGTGAGT TATATCTGCG TGGCTGGGCT GCCTCTCTGG ATGATGGGGT ATTGCCTGCC
GTTGAGATCA AGGTCAATGG AACTTTTCAT CGCTGCCCTA CCGGGTTGCA GCGCAAGGAT
GTCTGTCAGG TTTTTAGGGA TGATCGTTTA GAATCCGCTG GCTGGGAGTT TAGCTATCCG
CTGGATAGTA AAATACGGGA AGTTTGGGTA GAAGTCACGG CGAGGACAAT TGCTAACGAA
AGGGCCTTGC TCTACGGAGG AAGCCTTTCC CGTTCAAGCT GA
 
Protein sequence
MENRDSTEQQ LRTEADALCR DHRFRRAIWD HFQRPGNKSM GMALNTAIHP NDQMLIHSLR 
HHRDPNAALS QYYNVALQQH FAAQQILQAF FPNPGPDFAF LDFACGFGRL VRLLTLSLPA
ANIWVAEIQK DALAFVTQTF NVQALESSAS PEQFQAGRKF DFIWVASLFS HLPPGLFQRW
LERLLSLLNP SGILCFSVHD QALLPVGVGL PEVGILFNPH SENAELDPKT YGTTFVGEDF
VKRAIHEVSG EGHPYFRIPK GLAQEQDLYV VAKSSSLNLS GLQAFRYGPW GWVDERRILE
SGELYLRGWA ASLDDGVLPA VEIKVNGTFH RCPTGLQRKD VCQVFRDDRL ESAGWEFSYP
LDSKIREVWV EVTARTIANE RALLYGGSLS RSS