Gene Noc_2938 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2938 
Symbol 
ID3706420 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp3326815 
End bp3327924 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content57% 
IMG OID637739415 
Productchorismate synthase 
Protein accessionYP_344913 
Protein GI77166388 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0082] Chorismate synthase 
TIGRFAM ID[TIGR00033] chorismate synthase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGGCA ATACTCTTGG CAAACTTTTT ACTGTTACCA CCTTTGGCGA AAGCCACGGG 
CCGGCGTTGG GCTGTATTGT AGATGGCTGC CCCCCGGGCT TGGCTTTATG CGAGACGGAT
ATTCAAATTG ACCTGGATCG GCGCCGGCCC GGTAAATCCC GTCATACTAC CCAGCGCCGG
GAACCGGATC AGGTCCAGAT CCTCTCCGGG GTGTTCGAGG GAAAAACCAC CGGCACTCCC
ATCGGTTTGT TAATTGAAAA TGTCGATCAA CGCTCCCGGG ATTACGATAA AATCAAGGAG
CAAATCCGGC CCGGCCATGC AGACTATACT TATTTGCAAA AATATGGCCT GCGGGATTAC
CGGGGAGGGG GGCGTTCCTC GGCCCGGGAG ACGGCCATGC GGGTAGCTGC GGGCGCTATC
GCCAAGAAAT ACCTGGCAGA GCGGCATGGC GTAAAAATTC GGGGGTATCT GGCTCAGCTT
GGCCCCATTC GGGCTGAACG ATTCGACTGG GAAATCGTGG AGAAAAATCC CTTTTTCTGC
CCGGACCCGG ATAAAATATC TGAACTTGAA GCCTATATGG ACGCTCTCCG GAAAGAAGGC
GATTCCATTG GCGCCCGGAT TAACGTGGTG GCTACCGGAG TCCCTCCCGG TCTGGGCGAG
CCGGTCTTTG ATCGCCTCGA TGCGGATTTG GCCCATGCCC TTATGAGCAT TAACGCCGTT
AAGGGCGTGG AAATCGGTGT TGGTTTCGCC GCAGTGACCC AAAAGGGTAC TGACCATCGC
GACCCCCTTA CCCCGGAAGG TTTCCTCAGT AACCATGCGG GCGGTGTCTT GGGGGGGATT
TCCACGGGGC AGGATATTCT TGCTAGCATT GCGCTAAAAC CCACTTCCAG CCTCCGCTTA
CCCGAGCGTA CCATTAACTG CCGGGGCGAG TCTGCGGAAG TCGTTACCAC GGGCCGCCAT
GATCCTTGTG TTGGCATTCG GGCAACGCCT ATTGCCGAAG CCATGGCTGC CTTGGTGTTG
ATGGACCATC TGCTGCGCCA CCGCGCCCAA AATATGGACG TTCAGCCGAG TTTGCCATCC
ATCCCCGCTT ATCCCGGTGG TGGCGGCTAA
 
Protein sequence
MSGNTLGKLF TVTTFGESHG PALGCIVDGC PPGLALCETD IQIDLDRRRP GKSRHTTQRR 
EPDQVQILSG VFEGKTTGTP IGLLIENVDQ RSRDYDKIKE QIRPGHADYT YLQKYGLRDY
RGGGRSSARE TAMRVAAGAI AKKYLAERHG VKIRGYLAQL GPIRAERFDW EIVEKNPFFC
PDPDKISELE AYMDALRKEG DSIGARINVV ATGVPPGLGE PVFDRLDADL AHALMSINAV
KGVEIGVGFA AVTQKGTDHR DPLTPEGFLS NHAGGVLGGI STGQDILASI ALKPTSSLRL
PERTINCRGE SAEVVTTGRH DPCVGIRATP IAEAMAALVL MDHLLRHRAQ NMDVQPSLPS
IPAYPGGGG