Gene Noc_2370 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2370 
Symbol 
ID3704810 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2718002 
End bp2718979 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content55% 
IMG OID637738853 
Productmannosyl-glycoprotein endo-beta-N-acetylglucosamidase 
Protein accessionYP_344358 
Protein GI77165833 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1705] Muramidase (flagellum-specific) 
TIGRFAM ID[TIGR02541] flagellar rod assembly protein/muramidase FlgJ 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTCAAT CGTCTGCGCT TTTGCCCTTT TATGGCTATT CCAAGAATGG CTTTGCCGAG 
CTTCGTCAAA CCGCCCGGCA AGACCCCACG GACCCGGACA CCCTGCGCCG GGTTGGGACT
CAGTTTGAGT CCTTTTTGAT CCAGACCATG CTAAAGAGCA TGCGGGAAGC GGGCCAGGGA
GGTGGGGTGC TGGATAATAA GCAAGTCGAT TTCTACCAGG GACTTTTTGA TCAGCAAATT
GCTTTTGAAA TCGCCCGTCA TGGGCGTTTG GGGCTGGCGG ATAGAATTGC CGCCCAATTA
GGCAATGGGA CAGCCACTGA GAAGTTGCCT GTCACCACCC AGAATTTTGC GCCTCCCCGC
CGTCAGATGG AACAGCCATC GTCCATCGAT CCCAATGCGT TGCCGGCATT CTCCCGTTTT
GAAAAAATAG AAAAACCATC CGCCGTTGAG GCTGAGGCGT TGCCGACGCT TCCCCATTTT
GAAACAGCCG AGGATTTTGT CCGAATCCTT TGGCCCCATG CCCAGCGAGC TGCCCAATCT
CTGGGATTGG ATCCTCGACT GCTGCTGGCC CAGGCCGCAT TGGAAACAGG TTGGGGCAAG
CAAATTATCC GCACCGAGGC ACAAGGAAGC AGCCATAACT TATTTAATAT CAAAGCAGAC
AGCCGCTGGC AGGGTCCAGC AACCCAGATC AGCACCCTCG AATACCGGCA AGGGGTGGCG
GTGCGGGAGC AGGCTCCTTT TCGGGCTTAT GAGTCCTTTG ACCAGAGTTT TAACGATTAT
GTTGCTTTTC TGCGGCAACA GCCCCGCTAT CATCACGCCC TCACTCAAAC CCATAGCGCG
ATAGACTTTA TGCACTCCCT AGCCGAGGCG GGCTATGCTA CCGATCCGGC CTATGCGGAC
AAAGTGCTGC GGGTTTTCAA GGGCGGGACT CTAAGTCAGG CTTTGGAGAA GTCAAGGATA
GCGGTAGAAG ACCGTTAA
 
Protein sequence
MVQSSALLPF YGYSKNGFAE LRQTARQDPT DPDTLRRVGT QFESFLIQTM LKSMREAGQG 
GGVLDNKQVD FYQGLFDQQI AFEIARHGRL GLADRIAAQL GNGTATEKLP VTTQNFAPPR
RQMEQPSSID PNALPAFSRF EKIEKPSAVE AEALPTLPHF ETAEDFVRIL WPHAQRAAQS
LGLDPRLLLA QAALETGWGK QIIRTEAQGS SHNLFNIKAD SRWQGPATQI STLEYRQGVA
VREQAPFRAY ESFDQSFNDY VAFLRQQPRY HHALTQTHSA IDFMHSLAEA GYATDPAYAD
KVLRVFKGGT LSQALEKSRI AVEDR