Gene Sala_2489 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_2489 
Symbol 
ID4081365 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp2627631 
End bp2630441 
Gene Length2811 bp 
Protein Length936 aa 
Translation table11 
GC content68% 
IMG OID638010867 
ProductTrwC protein 
Protein accessionYP_617529 
Protein GI103487968 
COG category[L] Replication, recombination and repair 
COG ID[COG0507] ATP-dependent exoDNAse (exonuclease V), alpha subunit - helicase superfamily I member 
TIGRFAM ID[TIGR02686] conjugative relaxase domain, TrwC/TraI family 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.604785 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCGCGT CGGTATCGGC GCTGACGAGT TCGGCGCAGG CGAGCAGCTA TTATGAGGCC 
GACGATTATT ATGCCGAGGG CGGGCTGTCG CCGTCCGAAT GGCAAGGCAA GGGCGCCGAG
GAGCTTGGGC TGTCGGGCGA TGTCGATCGC GACCAATTTC GGGAACTGCT CGACGGCAAG
GTCGCGGGCC AGCAACTCGG CACGGTTCGT GACGGCCAGC TTGAGCATCG GCCCGGCTGG
GATGTGACAC TGAGTGCGCC CAAGTCGGTG TCGATCATGG CCGAGGTCGC GGGCGACCGG
CGGTTGATCG AGGCACATGG GCAGGCCGTG AAGACGACGC TCGCGCATAT CGAGGCGCAT
ATGGCCGCGA CCCGCGTTCG GCACGGCGGC AGCGTGACGC GTGAGGCCAC CGGCAATCTC
GTTGTCGCCG GCTTTCAGCA CGGGACCAGC CGGGCGCAGG ACCCGCAGCT TCATACGCAT
AATGTCATCA TGAACGCGAC GCAGGGCGAA GATGGTTCAT GGCGCAGCCT CGAACCGCGC
GCCATCTATC AGCTCCAGAA GCAGATCGGC GCGATCTACC GGCAGGAGCT GGCCTTGAAG
GTCGGCGAGC TCGGCTATGA GATTGCACCG GGCAAGGAGT CGATGTTCGA AATCAAAGGC
GTCTCGGAGG CCGCGATGGC GGCGTTCAGC ACGCGCAGCG CCGAGATTGA AGCGGCGCTA
GGCGAACGCG GAACATCGCG CGAAGAGGCC AGCGCTGCCG AAAAGCAGGT GGCCGCGCTC
GATACGCGGC AAGCGAAGGT GGTGGCCGAC CATGGCGCGC TTGTTGCCGA TTGGCGCGAG
ACCGCCGACC GGGCAGGGTT CGACGCCGAG GCTCGGCTGG CGTTGGTGCG TGAGGCAGAA
GCTCGGGCAG CGAACGGTGT TCAGCTTCCC GATCCATCGG TCGCCGATCG CGCCGTCGCC
CATGCCGCCG ACAAGCTTGG CGAGCGGCAG TCGGTGTTTG CGGTCGCGGC GCTCCATGAG
GAAGCGGGCC GGGTTGGGCT TGGGAAGGTC GGCTATTCCG AGATCGGCGA AGCGATCGGG
CGGGCGACAA AGGAAGGCGA GCTGGTCGAG CGCACCTTCC TCGATCGGCG CGGCGCCGCG
TTCGCGGGGT TCACGACCAG CCAGAATATC GCCGCCGAGA AGACGCTGCT TCGGATCGAA
GCCCGCGGTC GCGGTGCGCT CGCGCCGATA GCCTCCCCGC TTGCCGCCGC CAAGGCTGTC
GCTGGCGCGG CCGCGCAGGC GGAGCGGTCG GGGTTTGGCT GGAATCCCGA CCAGAAGGCC
GCGACCGAAC AGCTCCTTAC CAGCCGCAAT CGGGTCACCG CGGTCCAAGG CTATGCTGGC
ACCGCCAAGA CGACGACGGT GCTCGCCACC TTCGCGCGCG AGGCCGAAGC GCGCGGCGTG
TCGGTGGTCG CGCTGGCGCC GACTGCATCG GCGGCGATGA CACTCGGCGA GGCGCTCGGC
ACGCGCGGCG ATACCGTGGC GCGCCATCTG CTCGCGCCGG AAGATTCGGC GCCCGGGCAG
CCGGTGGCAT GGATCGTCGA TGAGGCTTCG CTCTTGTCGG CGCGCGATAC CGCGCGGCTG
TTCGAGTTGG CCGAGCAGCA TGATGCCCGA ATCATTCTCG TCGGCGACGT GAAGCAGCTT
GGATCGGTCG AGGCTGGCGC GGCGTTCGCG CAGCTTCAGG GCGTCGGCAT GGAAACCGCA
AAGCTCGGCG AGATCGTTCG GCAGAGCAAC GCGGCTACCA AGGAGGCGGT GCTCGCCTCG
ATCGAGGGCG ATGCGAAGAA GGCGCTCGCG GCGCTCGATC GCGGGGGCGG CCAGATCGTC
GAACATGCCG ATCGCTCCGG CCGCTTCGCC GCCATCGCCG ATCGCTATGC CGGGCTCGAC
AAGGCGGCGC GGACGCGAAC GCTGGTCATT GAGCCCTCGC GCGAAGGGCG CGACGCGCTG
ACAGCAGGCA TCCGCACGGC GCTCGTCAAT TCGGGCGCGC TTTCCGGTCC CGCTGTCACG
ATGGAGAGCC TCGTCAACAA GGGGCTCACC CGTGCCGAGG CCCGCGATCC GTTGAGCTAT
GACAGGGGCG ATGTTGTGCG CTTCACCCGC GATTATGCCG ACAAGGGCGT AGCGCGCGGC
GACGCCTATC GTGTCGAGGC GGTCAATCCG GCCAAGGCTG CCATTGCACT GAGGTCCGAG
GATGGGCGCG AGGTCGATTG GCGGCTTCGG CAATGGGGCG CCGGCAAGGT GCAGGTGTTC
GCGCCGCAGA ATATTGACCT CAGGACCGGC GACAGCATCC GCTTCACCCG CAACGATCGC
GACGCCGGGC GGATCAATGG CGCGCGGGGC GAGGTGATCG CGGTGGACGA GCAGGCGCGG
ACGGCGACGG TGCTTGGCGC GCGCGGTCAG GTACAGACTC TCGACCTCGA TGCCGTGCGC
GACCGGCATA TCGCCCACGC TTATGTCGAT ACCGCTTTTG CCGCGCAGGG ACGCACCGCC
GATCATGTCA TAATCCACGC GGACAGCAAG GCGACCAATC TGGTCGACCA GAAAAGCTTC
TATGTCGGCA TCTCGCGCGC AAAGGAGTCG GCGACGATCG TCACCGACGA TCGCGCAAAA
CTGACGTCGG CGATCAATGA GCGCGCCGGG GCCGTCCAGA CCGCGCTCTC ACAGGCACCT
GCCGCCGGGG CCGGCATGGT GCAATCAGCC ATCGCCGCGC CCGCTGCTGA CAAGGCGATC
AGCGCCGCGG TCTCGCAAGC GGCGACGTCG CTGCCCGGCA TGGGGCTTTA G
 
Protein sequence
MVASVSALTS SAQASSYYEA DDYYAEGGLS PSEWQGKGAE ELGLSGDVDR DQFRELLDGK 
VAGQQLGTVR DGQLEHRPGW DVTLSAPKSV SIMAEVAGDR RLIEAHGQAV KTTLAHIEAH
MAATRVRHGG SVTREATGNL VVAGFQHGTS RAQDPQLHTH NVIMNATQGE DGSWRSLEPR
AIYQLQKQIG AIYRQELALK VGELGYEIAP GKESMFEIKG VSEAAMAAFS TRSAEIEAAL
GERGTSREEA SAAEKQVAAL DTRQAKVVAD HGALVADWRE TADRAGFDAE ARLALVREAE
ARAANGVQLP DPSVADRAVA HAADKLGERQ SVFAVAALHE EAGRVGLGKV GYSEIGEAIG
RATKEGELVE RTFLDRRGAA FAGFTTSQNI AAEKTLLRIE ARGRGALAPI ASPLAAAKAV
AGAAAQAERS GFGWNPDQKA ATEQLLTSRN RVTAVQGYAG TAKTTTVLAT FAREAEARGV
SVVALAPTAS AAMTLGEALG TRGDTVARHL LAPEDSAPGQ PVAWIVDEAS LLSARDTARL
FELAEQHDAR IILVGDVKQL GSVEAGAAFA QLQGVGMETA KLGEIVRQSN AATKEAVLAS
IEGDAKKALA ALDRGGGQIV EHADRSGRFA AIADRYAGLD KAARTRTLVI EPSREGRDAL
TAGIRTALVN SGALSGPAVT MESLVNKGLT RAEARDPLSY DRGDVVRFTR DYADKGVARG
DAYRVEAVNP AKAAIALRSE DGREVDWRLR QWGAGKVQVF APQNIDLRTG DSIRFTRNDR
DAGRINGARG EVIAVDEQAR TATVLGARGQ VQTLDLDAVR DRHIAHAYVD TAFAAQGRTA
DHVIIHADSK ATNLVDQKSF YVGISRAKES ATIVTDDRAK LTSAINERAG AVQTALSQAP
AAGAGMVQSA IAAPAADKAI SAAVSQAATS LPGMGL