Gene Noc_2005 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2005 
Symbol 
ID3705195 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2312510 
End bp2313757 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content48% 
IMG OID637738482 
Producthypothetical protein 
Protein accessionYP_343997 
Protein GI77165472 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.87614 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATACTA CTGCGATAAG CAAAAAAACC CGTCTCCATC CATGGCTGAG ATGCCTGCTA 
TTTCTGACCA CCGCAGGCGG TGTACAAGCA GAAGATTTTA GAGAAACCAC AGGACTTTTC
GATGCCTTAA CCGGCATCAA TATCAATGAA ACAAAATTCA TGCAGTCACT AGGAGTAACG
ATTAATGGCT GGCTAGAAGG CGGCTATACT ATCAATCCAG ACGATCCCCG TGATAACTTC
AATGGACCCG TTACCTTTAA TGACCGTGCC AACGAATTCA TGGGAAACGA AGCCTATTTG
TTCTTTGAAC GCGGCGTGAA TGTCGAGGGC GATCGCTGGG ACTTTGGCGG GCGGGTCGAT
TTTCTTTTTG GTACCGATGC CCGTTTCACC CAGGCAGCGG GCCTAGATGA CAACATCATC
GGTGATGATA CTTTTCGCTT CTACAAATTT GCTATCCCGC AACTATACGT GGAAGCCTAT
GCTCCCTATG GCAACGGCAT CACGGTAAAG CTTGGTCATT TTTATACTAT CATCGGTAAT
GAAGTCGTGA CGGCCCCCGG TAACTTCTTC TATTCCCATG CCTATACGAT GCAGTATGGC
GAACCCTTCA CCCATACTGG TTTTCTAGCC AGTTACCCCT TGACCGATAA TATTAGCATC
AATGGTGGCG GCGTTCTCGG TTGGGACAAT TTTTCCAAAG ATGCTGAAAA TCTTAATTTC
TTAGGCGGGG TAAGCTGGAG CAGTGATGAT GCGCGAACCT CCTTGGCTGT CGCCATCATC
ACGGGCGATG TCTCTGATGT GGGGGGAACC CCAGATGATC CTGATAACAA TCGCACCCTC
TATAGCGTGG TCTTCAACCA CGACTTCACT GATCGGCTTC ACTATACTTT TCAGCACGAT
CTAGGCATAG AACAGCGTGC CATTAATAAC AACAAATCGG CGGAATGGTT TGGCATCAAT
CAATATTTAT TTTATGATAT CAATGAAACT GTAAGCACGG GTTTACGGTT CGAGTGGTTC
CGCGATGACG ACGGCACCCG TGTCTTTGTC AATGATAGCT CCGGTCTCCC GGTTTCCGCC
GCCGCAAATT ATTTTGCCAT CACCGGCGGC TTGAACTGGC GACCATTAAG ATGGGTCACC
GTTCGTCCAG AAGTACGTTA TGACTGGGCC ACCAATTTCG AGGCTTTTGA TAATAACAGC
GATAAGAATC AATTTGTTGT CGCTGCGGAC ATTATCGTTC AATTCTAA
 
Protein sequence
MNTTAISKKT RLHPWLRCLL FLTTAGGVQA EDFRETTGLF DALTGININE TKFMQSLGVT 
INGWLEGGYT INPDDPRDNF NGPVTFNDRA NEFMGNEAYL FFERGVNVEG DRWDFGGRVD
FLFGTDARFT QAAGLDDNII GDDTFRFYKF AIPQLYVEAY APYGNGITVK LGHFYTIIGN
EVVTAPGNFF YSHAYTMQYG EPFTHTGFLA SYPLTDNISI NGGGVLGWDN FSKDAENLNF
LGGVSWSSDD ARTSLAVAII TGDVSDVGGT PDDPDNNRTL YSVVFNHDFT DRLHYTFQHD
LGIEQRAINN NKSAEWFGIN QYLFYDINET VSTGLRFEWF RDDDGTRVFV NDSSGLPVSA
AANYFAITGG LNWRPLRWVT VRPEVRYDWA TNFEAFDNNS DKNQFVVAAD IIVQF