Gene Noc_1856 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1856 
Symbol 
ID3705120 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2108821 
End bp2109933 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content60% 
IMG OID637738335 
Producthypothetical protein 
Protein accessionYP_343852 
Protein GI77165327 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGCTCG CTCGCCGCCA ACAAATCAGC CTGGAAGAAA CACCCTTCTA CCATTGCATG 
GCCCGTTGTG TGCGTCGGGC CTTTCTCTGT GGGGAGGATT CCCTCACCGG CCAGAGCTTT
GAGCACCGTA AGCAATGGAT CGTGGACAAG CTCAAGGCGC TGGCAGGGAT CTTCGCCATC
GACGTATGTG CCTACGCCGT GATGAGCAAC CATTATCATG TGGTTTTACG GGTAGACCCC
GTGCGGGCTC AAGGTTGGTC CGATGAGGAG GTTATTGACC GTTGGCGGCG GTTGTTTAGC
GGCGGGGTGC TGGTGGAGCG CTTCCTCCAG GGTGAGACCG CCACCCAAGC CGAGCGGGAT
CAAGTCGCCG AGCTTGCCGT CCAGTGGCGC GAGCGCCTGT GGGACATAAG CTGGTTCATG
CGCTGCCTCA ACGAATCAAT CGCCCGTCAG GCCAATCAAG AAGATGGGTG CAAAGGTCGG
TTTTGGGAAG GGCGGTTCAA GAGCCAGGCC TTGCTGGATG AGCGGGCGCT CTTGGCCTGC
ATGGTCTATG TGGATTTGAA TCCCGTGCGT GCAGGAATCG CGGACACGCC CGAAGCCTCG
GACTATACTT CCCTGCAAGC CCGGATTCGG GCTTATGCCG AACAAAGACA GTTGCCGAAC
AATAGCGAGG GCGACACTCG TTCAGGGAAA GATAAACGCC CTCGCCGAGC GGTGTCTCCA
GAGGCCGGTT CTCCGCCCAC GCGGAATAGG CTCAGCGAAC CGTCGGCGGC TTTACTGCCT
TTCCGTGGGA GCGAGCCTGT CGATCAATCC TTGGCAGGAA TCCCGTTGGC GTTTTCCGAC
TACCTTACCT TGACCGATTG GACAGGCCGG GCCATCCGCA ACGACAAGCG TGGTGTTATC
CCTGAAGACG TGCCGCCGAT CTTGAGGCGT TTAGGAATTG ATGAAAATGC CTGGGTTGAG
ACGGTGCGCG ACTATGGGCG GCATTTCTGT CGAGTGGTGG GCCCGGTGGA GCGGCTGCGT
CGGTTGGCGG GAAAATTGGG CCACCGATGG CTGCGGGGCT TGAAGCCGAG TGGGGTGTTA
TACCCGCGGC CTCAAACAGG GTCTCCGAGT TAA
 
Protein sequence
MTLARRQQIS LEETPFYHCM ARCVRRAFLC GEDSLTGQSF EHRKQWIVDK LKALAGIFAI 
DVCAYAVMSN HYHVVLRVDP VRAQGWSDEE VIDRWRRLFS GGVLVERFLQ GETATQAERD
QVAELAVQWR ERLWDISWFM RCLNESIARQ ANQEDGCKGR FWEGRFKSQA LLDERALLAC
MVYVDLNPVR AGIADTPEAS DYTSLQARIR AYAEQRQLPN NSEGDTRSGK DKRPRRAVSP
EAGSPPTRNR LSEPSAALLP FRGSEPVDQS LAGIPLAFSD YLTLTDWTGR AIRNDKRGVI
PEDVPPILRR LGIDENAWVE TVRDYGRHFC RVVGPVERLR RLAGKLGHRW LRGLKPSGVL
YPRPQTGSPS