Gene Noc_2579 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2579 
Symbol 
ID3704583 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2931676 
End bp2933007 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content53% 
IMG OID637739059 
Productpeptidase M24 
Protein accessionYP_344562 
Protein GI77166037 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAGCCA AGGAATTCGC ACGCCGCCGC AAACACTTAT TGCAAATGAT GGGAGAAGGC 
AGCATAGCTA TCCTTCCCAC AGCAAGTATT TATCCCCGGA ACCGAGATGT GATGTTTCCC
TTTCGCGCCG ATAGTGATTT CTACTATTTG ACGGGCTTTC CCGAGCCCGA AGCAGTGGCG
GTCTTCGCTC CTGGACGCAA GCACGGTGAA TATCTCCTGT TCTGCCGGGA GCAAGATCCG
GAGAAAGAGA TTTGGGAAGG CCGTCGTGCT GGCACTCAAG GCGCCTGCAA GAATTATGGC
GCTGACGATT CCTTTCCTAT TACCGATATC GACGATATCT TGCCTGGCCT CCTAGAGGAC
AAGGCCCGGG TCTACTACGC CATGGGTTAC TATCCTGCCT TCGATCAACG GATGATAGGG
TGGGTTAACC ATATCCGGCG GGCATCCCGC GCTGGCAAAC GACCGCCAGG GGAGTTTATT
GCCCTTGATC ATCTACTTCA TGAAATGCGC CTGATTAAAA GCGCGCAAGA AATTAGAACC
ATGCGGGAGG CAGCCCGGAT CTCTGCTAAA GCCCACATCC GGGCTATGGA AAACTGCCAT
CCTGGCATAA TGGAATACCA AATAGAAGCG GAGTACCTTC ACCATTTTTT TAGCCACGGC
TGCCGCGCCC CCGCTTACCC TTCCATTGTG GGCAGCGGCG GCAATGCTTG TATTCTCCAC
TATACAGATA ATAATGCCCG CTTAAAAAAA GGCGATCTGC TTTTGGTCGA TGCCGGCGCT
GAGTATGACT ATTATGCTGC GGATATTACC CGCACATTTC CAGTGAGCGG TCGCTTTTCC
TCGGCCCAAA GAGCTATTTA CGAACTCGTC CTAGAAGCGC AGCTTGCCGC TATTGCCGAG
GTCCAACCAG GCAATCATTG GAATCAGCCC CATGAAGCAG CTGTCCGGGT GCTCACCGAG
GGCCTAGCAG CTCTTGGCCT GCTCAAAGGG CGGGTGAGCA CACTACTAAA AAAGGAGCAT
TATCGCCGCT TTTATATGCA CCGCACGGGG CACTGGCTTG GCATGGACGT TCATGATGTA
GGGGATTATA AAGTCGATGG CGAATGGCGG GCTTTTGAGC CTGGCATGAC GCTAACGGTA
GAACCAGGAG TGTATATCCC CGCCGATAGT CAGGGAGTCG CTAAAAAATG GTGGAATATC
GGGGTTAGGA TTGAGGATGA TGTGCTCGTT ACCAAAGAGG GTTGCGAACT TCTGAGTGCA
GATGTCCCTA AAACGGTAGA CGAAATTGAA GCCTTGATGG CTTCCTCCCA GAGAGGAGCG
TCCGCGTCAT GA
 
Protein sequence
MEAKEFARRR KHLLQMMGEG SIAILPTASI YPRNRDVMFP FRADSDFYYL TGFPEPEAVA 
VFAPGRKHGE YLLFCREQDP EKEIWEGRRA GTQGACKNYG ADDSFPITDI DDILPGLLED
KARVYYAMGY YPAFDQRMIG WVNHIRRASR AGKRPPGEFI ALDHLLHEMR LIKSAQEIRT
MREAARISAK AHIRAMENCH PGIMEYQIEA EYLHHFFSHG CRAPAYPSIV GSGGNACILH
YTDNNARLKK GDLLLVDAGA EYDYYAADIT RTFPVSGRFS SAQRAIYELV LEAQLAAIAE
VQPGNHWNQP HEAAVRVLTE GLAALGLLKG RVSTLLKKEH YRRFYMHRTG HWLGMDVHDV
GDYKVDGEWR AFEPGMTLTV EPGVYIPADS QGVAKKWWNI GVRIEDDVLV TKEGCELLSA
DVPKTVDEIE ALMASSQRGA SAS