Gene Sala_0201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_0201 
Symbol 
ID4082117 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp215765 
End bp217171 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content70% 
IMG OID638008560 
Productargininosuccinate lyase 
Protein accessionYP_615258 
Protein GI103485697 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0165] Argininosuccinate lyase 
TIGRFAM ID[TIGR00838] argininosuccinate lyase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAATA CTCCGGACAA GAGCAGCATG TGGGGCGGCC GCTTCGGTGG CGGACCGGCG 
GCGATCATGC AAGAGATTAA CGCCTCGATC CCCATCGACA AGCGCCTTTG GGAAGAGGAC
ATCGCCGCCA GCCGCGCCCA CGCCGCGATG CTCGGTGCTT GCCGAATCAT CAGTGCCGAC
GATGCGGCAG CGATCGACCG CGGTCTTGCC CAGATCGCCG AAGAATTTGC CGAAAACGGC
GTGCCCGTCG ACCTCAGCCT CGAGGACATC CACATGACCG TCGAGGCGCG GCTGAAGGAG
TTGATCGGCG AACCCGCCGG GCGCCTCCAC ACCGCGCGCT CGCGCAACGA CCAGGTCGCG
ACCGATTTCC GCCTGTGGAC GCGTACCGCC TGCGAGCGCA TCGACGCCGG GCTCGCCGCG
CTCCAGTCGG CGCTGCTTCA GCGCGCCGAC GAGCATGCCG ACAGCATCAT GCCGGGCTTC
ACGCATTTGC AGGTCGCGCA GCCGGTGACG CTCGGCCACC ATCTGCTCGC CTATGTCGAA
ATGGCGCGCC GCGACCGCGG CCGCTTCGCC GATGCGCGCC GCCGCCTCAA CGAATCGCCG
CTCGGCGCCG CGGCGCTCGC GGGGACGGGC TTTCCTGTCG ATCGCGACGC CACCGCTGCG
GCGCTCGGCT TCGACCGGCC GATGGCGAAC AGCATCGACG CGGTATCCGA CCGCGACTTC
GCGCTCGAGT TCTGCGCCGC CGCGGCGATC GCCGCGATCC ACCTGTCGCG CCTTGCCGAA
GAAATCGTCA TCTGGGCCAG CCAGCCCTTC GGCTTCGTCG CGCTGCCCGA TGCCTGGTCG
ACGGGCAGTT CGATCATGCC GCAAAAGCGC AACCCCGACG CCGCCGAACT GGTGCGTGGG
CGCGCGGGCC TGCTGCTCGG CGCCTTCCAG CGGCTCGCCG TCATCGTCAA AGGGCTGCCG
CTCACCTATT CGAAAGACCT TCAGGACGAC AAGGAAACGC TCTTCGGCGC GTTCGACGCG
CTCGCGCTGT CGCTCGCGGC GATGACGGGC ATGGTCGAAA CGCTGAGCTT CCGCACCGAC
CGGATGCGCG CGCTCGCCGC GTCGGGCTAT TCGACCGCGA CCGACCTTGC CGACTGGCTG
GTGCGCGAGG CGGGGCTGCC GTTCCGCGAA GCGCATCATG TCGTCGGCGC CTGCGTCAGG
CGCGCCGAGG AACTGGGCGT CGAGCTGCCC GCGCTGCCCG CCGCCGACGC GGCGGCGATC
CACGCCGCGG TCACCCCCGA TGTCCTCGCC GCACTCACGG TCGAAGCATC GGTCGCCAGC
CGCATGAGCT ATGGCGGGAC CGCGCCCGAA CGGGTAAGAC AGGCCATCGC TGCGGCGCGC
GCTGCCGCGG CCCAGGGACA GGATTGA
 
Protein sequence
MANTPDKSSM WGGRFGGGPA AIMQEINASI PIDKRLWEED IAASRAHAAM LGACRIISAD 
DAAAIDRGLA QIAEEFAENG VPVDLSLEDI HMTVEARLKE LIGEPAGRLH TARSRNDQVA
TDFRLWTRTA CERIDAGLAA LQSALLQRAD EHADSIMPGF THLQVAQPVT LGHHLLAYVE
MARRDRGRFA DARRRLNESP LGAAALAGTG FPVDRDATAA ALGFDRPMAN SIDAVSDRDF
ALEFCAAAAI AAIHLSRLAE EIVIWASQPF GFVALPDAWS TGSSIMPQKR NPDAAELVRG
RAGLLLGAFQ RLAVIVKGLP LTYSKDLQDD KETLFGAFDA LALSLAAMTG MVETLSFRTD
RMRALAASGY STATDLADWL VREAGLPFRE AHHVVGACVR RAEELGVELP ALPAADAAAI
HAAVTPDVLA ALTVEASVAS RMSYGGTAPE RVRQAIAAAR AAAAQGQD