Gene Noc_1188 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1188 
Symbol 
ID3706762 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1295348 
End bp1296403 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content53% 
IMG OID637737691 
Productsignal peptide protein 
Protein accessionYP_343220 
Protein GI77164695 
COG category[S] Function unknown 
COG ID[COG4255] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGTTGTTTC GAACCCCACC TCTGCGTTTA ACGCTTCTTG TGCCAGGACT TGCTCAGGCG 
CTAGAAGCAA GAGCCATAGA GGGGGAGGGG GCGCGGCTAC CTTTTCTGGA ATGCATCATA
GGACAGGCAG ACGTAGAGGC GCTTACCACA CCACTGTATG AAACTTTGCT CTTTGCTTTA
TTTGGCATTT CTCAATCAGG AACGATGGAT GTTCCCTTGG CGCCACTAAT GTATTCCTGG
GATAAGGGGG GTGGCTCCCC TGAGCCGGGG TGGTGGTTAC GGGCAGATCC CGTCTGTTTG
CACCCTGATC GGGATCGGCT CGTGCTTTTT GGACCCTCTC ACCTACAGTT GAGCAGAACT
GAGTCCCAGT CCTTAGCAAA GAGGGTGGCG CCCTTATTTA CTGAATATGG TTGGCAATTT
CAGGCCCTAG AGCCAGATCG CTGGTATCTT CGGTTGCCCC AGCCGGAGCA GGTTACTTTT
ACCGCTTTGA CTGCGCTCGA GGGAAAGTAT ATTGAGCCAG GACTTCCCTC GGGCCCCAAT
AGTTCCCGTT GGCGGACTCT GCTCAATGAA ATCCAGATGC TCCTCCATGA TTGCCCTATC
AATCTCGAGC GAGAGAAGCA GGGGCTTCCG TTGGCGAATA GTGTATGGTT TTGGGGGGCG
GGGGAAGCGC CATCATACCC CATTCCGCCT CTGTGGCGGC AGATTGGTTG GGACCACAAT
CCCCTTCTTC AAGCCTTGGC TGCTTATTGC GAGATTCCAG GCAGACCTGT GCCGGAAGGG
GCTACAGTCT GGTTGGCACA AAATTCAACC ATACCGGGTG ACTATTTGGT GGGATTGGAT
TCTTTATTGC ACACACCAGA CTCTTTCCCT TGTGCTGGAG CTTTGCAAGC ATTGGAAGAG
AACTGGTTTA GCATTTTATA CGCGGCCTTA CGCAACAGAC AGTTAGCCAG CTTAACCTTC
TATCCGATGA ATGGGTATCG CTATCATTTA ACTTGGCAGC GAAGCTGGCG TTTATGGCGG
CGTCCGCGCT CTCTTATAAA AAGCGTGGGG GGTTAA
 
Protein sequence
MLFRTPPLRL TLLVPGLAQA LEARAIEGEG ARLPFLECII GQADVEALTT PLYETLLFAL 
FGISQSGTMD VPLAPLMYSW DKGGGSPEPG WWLRADPVCL HPDRDRLVLF GPSHLQLSRT
ESQSLAKRVA PLFTEYGWQF QALEPDRWYL RLPQPEQVTF TALTALEGKY IEPGLPSGPN
SSRWRTLLNE IQMLLHDCPI NLEREKQGLP LANSVWFWGA GEAPSYPIPP LWRQIGWDHN
PLLQALAAYC EIPGRPVPEG ATVWLAQNST IPGDYLVGLD SLLHTPDSFP CAGALQALEE
NWFSILYAAL RNRQLASLTF YPMNGYRYHL TWQRSWRLWR RPRSLIKSVG G