Gene Noc_2752 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2752 
Symbol 
ID3705290 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp3123932 
End bp3125284 
Gene Length1353 bp 
Protein Length450 aa 
Translation table11 
GC content55% 
IMG OID637739230 
Producthypothetical protein 
Protein accessionYP_344731 
Protein GI77166206 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00174638 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTTGATAG ATGATTTAAC CCAGGCGCTT AGCGCCAGTG AAGGCTCATC CGCCGAGCAG 
TTGAAACAGC TCAAAGCCCA ACTGCTAGGC CATTGCCGAG CGCCTGTCAA GGTTACGGAT
GAGGACGTTC CCCCTCTGCT TATTGCCGCG ATTAATTTTG CTTACGGCAA TCCCACTTCC
CCAGAGCAGT TGCTAAGCGC GGAAAGCGCC AAGGCCGCGC TGACTGCCAA GGTAACCCCT
AAAACTATTG GGGATATGTT TATTTGGGCC GCCTTGGCTT CCCATATGGG CACCCACCAT
CCCAAGTCCC GCAACGACAA GGTCAGTTCC GGACCCATTG TGGCCCGTTC CTTAACACCG
CCGCCAGCAG GCTTGGTGAC CTCGGCTAAC CTCAGAACGC GCCAATTCTC CGGCTCCGGT
AATGGCGCCA ATACCGCCCA CTATCAATGG TTGGCGTTCA CCTATGAAGA CTCGGCCGGG
GTATTCAGCG TGATTGAGCG CGCCGCTAGC CAGGATGAGC GTCTGGGCGA AACGGTGCTG
ACTTTGGGCA TTCAGGAAAG TCAGTGGCAA GCCTTCCAGG AAGCGGCGGA GGCACATTTA
ACGGTCACCA ATTCCGCGCC CTTGGATCGG CAATTAAAAC AAGTTTTTAT TCCTGATCCA
GGCCAAAATA ACGATTTTAT CGTCATCACG CCCCTTGCCG CCACGGCGGT TATCGGCGCA
TTTGAACAGC ATCGGACTAA ATTGCGGGAA CAAGACGCCA ACCTGAATTT TCACCGGATA
GGTGTTGGTG GTGCTAAGCC GCAAAATGCC GGCAGCCTGA TGAATGAACT GGGCGGTAAT
CTGCGGCAAT TAACCATGAC CATCCCCCGG CTTGATCTCA CCCAACGGCA GCGGCGGCTT
TGGCATTTGC AGCGAGGGCG GCTGTTCCAG CCCCTGCCGA AAAAAGAGGC GCAGCGTTTT
ACATGGTGGC TGGAGCTAGA TTGGACTCAG CGCTACGGTA ATCGGCAAGA CCATCTCCAG
CGCCTTGAAC AGCGGATCGC TGAATGGCTA TTGCCGGAGC TTGAGGTGCA GGAACAGCTT
TTTGCTTGGC TATCATCCGA TTCCCTAGAT GCTGTTCATG AGCGCCAAAA ACTTGAAGCC
GAGAATCTGC CCAACTGGAT TTTAACTTTG GCGGGGATCA GTAAAAATCC GGAAACGGAG
CACGGCACTC ACGAAGCACG GCAAGAAGCG GCCTATAAGG CGCTTACTTT TCATTTGAAA
TCACCCCTGA GCGATGAAAT CGATAAGCTG ATTCAAGACG CTTTGGAAAG CCTGTTGGCT
AGGCGCTACT CGGCGGAGGG GGCAGTAGCA TGA
 
Protein sequence
MLIDDLTQAL SASEGSSAEQ LKQLKAQLLG HCRAPVKVTD EDVPPLLIAA INFAYGNPTS 
PEQLLSAESA KAALTAKVTP KTIGDMFIWA ALASHMGTHH PKSRNDKVSS GPIVARSLTP
PPAGLVTSAN LRTRQFSGSG NGANTAHYQW LAFTYEDSAG VFSVIERAAS QDERLGETVL
TLGIQESQWQ AFQEAAEAHL TVTNSAPLDR QLKQVFIPDP GQNNDFIVIT PLAATAVIGA
FEQHRTKLRE QDANLNFHRI GVGGAKPQNA GSLMNELGGN LRQLTMTIPR LDLTQRQRRL
WHLQRGRLFQ PLPKKEAQRF TWWLELDWTQ RYGNRQDHLQ RLEQRIAEWL LPELEVQEQL
FAWLSSDSLD AVHERQKLEA ENLPNWILTL AGISKNPETE HGTHEARQEA AYKALTFHLK
SPLSDEIDKL IQDALESLLA RRYSAEGAVA