Gene Rpic_1042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpic_1042 
Symbol 
ID6289579 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRalstonia pickettii 12J 
KingdomBacteria 
Replicon accessionNC_010682 
Strand
Start bp1108315 
End bp1109364 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content63% 
IMG OID642624614 
ProductNMT1/THI5 like domain protein 
Protein accessionYP_001898622 
Protein GI187928135 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.778479 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGACAC AAGTCATCGG GCGCTGGGCA AAGTGGATGG GCGCTGCACT GTGTGCGACG 
AGCCTGCTGG CTGCCAGTCC AGCCGTTCTG GCGCAGGGCA AACCTGAAAA GAGCAAGGTC
ACCATCGCGG TGGGCGGCAA GGCGTTGTTC TACTACCTGC CGCTGACGAT TGCAGAGCGT
CTGGGTTACT TCAAGGACGA AGGCCTGGAC GTTGAAATCG TCGACTTTGC TGGCGGCGCA
AAAGCCCTGC AGGCCGTGGT GGGCGGTAGC GCCGACGTGG TGAGCGGCGC GTACGAGCAC
ACGCTGGTGC TGCAGGCCAA GGGGCAGATG TACCAGGAGT TCGTGCTGCA AGGGCGGGCC
CCGCAGATCG TGCTGGCGGT CAACAACAAG ACGGTGCCCA ACTACAAGTC GATTGCCGAT
TTGAAAGGCA AGAAGATCGG CGTGACGGCT CCCGGTTCGT CGACCAACAT CATGGTCAAC
TATGTGCTGG CGCGCGCCGG CATCAAGCCG AACGAAGTGT CGATCATCGG CGTGGGTCCG
AGCAGCGGGG CGATTGCCGC CGTGCGTGCT GGGCAGATCG ATGCCCTGGC CAACCTGGAC
CCGGTGATGT CGATGCTCAC GCAAAAGAAC GAAGTGCGCG TCGTGTCCGA TACCCGCACC
CTGGCCGATA CCAAGGCGGT GTTCGGTGGC AACATGCCGG CCGGCTGCCT GTACGCGTCT
ACCGCGTTCA TCCAGAAGAA TCCCAACACG ACGCAGGCGA TGACCAACGC CATGGTGCGT
GCGCTCAAGT GGCTGCAAAA GGCGGGCCCG TCGGACATCG TCAAGACGGT GCCCGAAGCC
TATCTTTTGG GCGACCGTGC GCTGTATCTG GCGGCGTGGG AGAAGGTGCG TGAGGCCATC
TCGCCGGATG GCACGATGCC GGCCGACGGC CCGGCTACGG CGCTGCGCAC GCTGTCGGAG
TTCGATGCGG AAGTGAAGGG CAAGCAGATC AAGCTCGACC AGACCTTCAC CAATACCTTC
GTGCAGAAGG CCAACGCCAA GTACAAGTAA
 
Protein sequence
MGTQVIGRWA KWMGAALCAT SLLAASPAVL AQGKPEKSKV TIAVGGKALF YYLPLTIAER 
LGYFKDEGLD VEIVDFAGGA KALQAVVGGS ADVVSGAYEH TLVLQAKGQM YQEFVLQGRA
PQIVLAVNNK TVPNYKSIAD LKGKKIGVTA PGSSTNIMVN YVLARAGIKP NEVSIIGVGP
SSGAIAAVRA GQIDALANLD PVMSMLTQKN EVRVVSDTRT LADTKAVFGG NMPAGCLYAS
TAFIQKNPNT TQAMTNAMVR ALKWLQKAGP SDIVKTVPEA YLLGDRALYL AAWEKVREAI
SPDGTMPADG PATALRTLSE FDAEVKGKQI KLDQTFTNTF VQKANAKYK