Gene Noc_0282 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_0282 
Symbol 
ID3706453 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp310853 
End bp312124 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content56% 
IMG OID637736798 
Productextracellular solute-binding protein 
Protein accessionYP_342342 
Protein GI77163817 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACAGT GGTTGAAGGT AGGTTTGCTT CACCTTGTCT CCGGGCTTAA AACCGGGATG 
TTGTTACTCT TTTTATTGCT TGCCGCTGCC TGCTCGGATT CTCCTGATAA CCCTGTCCCC
ACCCTAAAAT GGTACGTGTT CGATGAGCCC TCCGGGGCCT TTGAGACGGC GGCAAAACGA
TGCTCGGCGG ACGCCAAGGG TGTCTATCAG GTGGAAATCG CCGCCCTGCC CGCTGATGCC
AGCCAACAGC GGGTACAATT GGTGCGGCGG CTAGCGGCGA AAGACACGGC TATCGACCTC
ATTGGCATGG ATGTTATCTG GACCGCTGAG TTCGCCGAGG CGGGCTGGAT TTTGCCCTGG
CTAGGAGAGG CGGCGGACCA AGCCAGACAA GGGCGCCTAC CCTCCACCAT TGAAAGCGCT
ACCTATGACC ACCAACTTTG GGGCATCCCC TTCACCAGTA ACATTCAATT GCTTTGGTAC
CGCACCGATC AGGTAGCGAA ACCGCCCCAG AGCTGGGATG AATTGATTCA AAGCGCCGAG
GCCCTGGGAA TCGGCACCTT GCAAGTACAA GGGGCACGCT ATGAAGGTTT GACCGTCTTG
TTTAATTCCC TGCTAGCCTC CGCTGGTGGC TCCGTGCTAG ACAAAACCGG CAAGGCGGTT
TCCTTGGAAG CAACGCCCAC TGAAAAAGCT CTCCGTATTA TGAAGCGCAT CGCCACTTCC
CCGGCTACTA ACTCCTCCCT TTCCATTGCC CGGGAAGATG AAACCCGGCT CGCCTTTGAA
GGGGGCAGCG CCTTTATGAT CAACTATACT TATGTTTGGC CCAGCGCCCA GCAGAATGCC
CCCCGGGTCG CCGCCCACAT GGGCTGGGTC CGCTGGCCAG CAGTTATTGA AGGTCAACCT
AGCCGGGTCA CCCTAGGAGG CATTAACTTA AGTGTCGGCG CCTATTCGCG ATATCCCCAG
CTGGCTTTTC GAGCGGCTAC CTGCATCGCT TCCCAGGAAC AGCAACGACT TGCCGCAATT
AAGGGGGGGC TGCTGCCTAC CTTTGAGAAG CTCTATGCGG ATCCCTCGAT TAGAGAAGCC
CTTCCCTTCG CGGACACCCT GCGTGCTACC CTTAAGAATG CGGCCCAACG GCCCTCAAGC
CCCCTCTATA ATGATATTTC CTTGGCCATC AGCCGGACGT TGCACCCCAT GAAAGCCATT
GATCCCCAGA AAGATATTGC TAGGTTGCGA AAAAACATCA ATCGGGCCCT TCACTCCCAG
GGGTTACTAT GA
 
Protein sequence
MKQWLKVGLL HLVSGLKTGM LLLFLLLAAA CSDSPDNPVP TLKWYVFDEP SGAFETAAKR 
CSADAKGVYQ VEIAALPADA SQQRVQLVRR LAAKDTAIDL IGMDVIWTAE FAEAGWILPW
LGEAADQARQ GRLPSTIESA TYDHQLWGIP FTSNIQLLWY RTDQVAKPPQ SWDELIQSAE
ALGIGTLQVQ GARYEGLTVL FNSLLASAGG SVLDKTGKAV SLEATPTEKA LRIMKRIATS
PATNSSLSIA REDETRLAFE GGSAFMINYT YVWPSAQQNA PRVAAHMGWV RWPAVIEGQP
SRVTLGGINL SVGAYSRYPQ LAFRAATCIA SQEQQRLAAI KGGLLPTFEK LYADPSIREA
LPFADTLRAT LKNAAQRPSS PLYNDISLAI SRTLHPMKAI DPQKDIARLR KNINRALHSQ
GLL