Gene Noc_2754 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2754 
Symbol 
ID3705292 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp3126233 
End bp3127306 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content49% 
IMG OID637739232 
Producthypothetical protein 
Protein accessionYP_344733 
Protein GI77166208 
COG category 
COG ID 
TIGRFAM ID[TIGR02566] CRISPR-associated protein, Csy3 family 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.180685 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGACTG TTCAGCTTCC CTCATTACTC AACTATACCC GCAGCATTGT TCCCAGCGAA 
GGCACCTTCT GGGTGGCCAA TGCCGATAAC CGGCCATTGC TTATCCAGGA TAAAACCTTA
TTAGGCACCA TCGCCAACTA CAGCTCAGTC TACAAAAAAG ATAAACAGCG TGATGAAGGC
GCTATTGAAA AGGAAATGAT GGCCGGCGAC AACAATATTC AACGAGTGGA TAGCTGTCAT
CTGCCCGCCG ATGCGGATAC TTTTGAGCTT CGTTTTACGC TTAAATTTTT GGCGAATGCC
AACGGCCCTG AAGCCTGCGA AGGTGCTGAA TTTCGAGAAG ACCTGGAAAG CATAGCGCAA
GCGTACGCTG AAAAAGGCGG TTTTACCCTA CTGGCCGAGC GTTATTTGGC TAATCTTCTA
AATGGGCGGT TTTTATGGCG CAATCGTTAT GGCGTCCAGC GTCAAATCAC CCTGCGTGCG
CCCTATAACG AGCTCAAAGA AAAAACCTTT GAGATTATTG ACCGTGCGGA ACCCACTCTG
CCGCAAGTAC GGATGGATGA GCTCAAACCT TGGATTGATC ATATTGCCAG CGCACTCAGC
GGTAAAACCT CTTTTTTCCT TATGGAAGTC AGCGCTAGGG TAACCATTGG CCTCGGGCAA
GAAGTGTACC CCAGTCAGGA ATTTGTTGAT AAGGATTCGC GGGGGCAAGG GAAAAAGTCC
AAGACCCTAT TTTTTGTGCA AGTCGCCGAT CAACAGGTCG CTGCCATGCA TAGTCAAAAA
ATCGGTAACG CTATCCGTAC CATTGATAAT TGGTATCCGG ACGCCAATGC GGATCGTCCG
CTGGCGGTTG ATCCCTTTAC CGTCGATAAG CGCCGAGCCC GTGCCGTGCG GTTGCCGGAT
CATGGAAAAT CGGACTTCTA TAGTCTTTTA AAAAATTTAC CCGCTCTAAA AGATGATATC
GAGCATGCGC CGAACGCTGA AGCCATACCC GGTCAAGCCC ATTATTTTAT GGCCGTGTTA
ATACGCGGGG GGGTGTTTAG CGGAGAGAAA AAAGCAGAGA AGAAAGCCAA GTAG
 
Protein sequence
MATVQLPSLL NYTRSIVPSE GTFWVANADN RPLLIQDKTL LGTIANYSSV YKKDKQRDEG 
AIEKEMMAGD NNIQRVDSCH LPADADTFEL RFTLKFLANA NGPEACEGAE FREDLESIAQ
AYAEKGGFTL LAERYLANLL NGRFLWRNRY GVQRQITLRA PYNELKEKTF EIIDRAEPTL
PQVRMDELKP WIDHIASALS GKTSFFLMEV SARVTIGLGQ EVYPSQEFVD KDSRGQGKKS
KTLFFVQVAD QQVAAMHSQK IGNAIRTIDN WYPDANADRP LAVDPFTVDK RRARAVRLPD
HGKSDFYSLL KNLPALKDDI EHAPNAEAIP GQAHYFMAVL IRGGVFSGEK KAEKKAK