Gene Sala_2999 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_2999 
Symbol 
ID4082943 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp3141195 
End bp3142649 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content68% 
IMG OID638011384 
ProductRNA polymerase factor sigma-54 
Protein accessionYP_618037 
Protein GI103488476 
COG category[K] Transcription 
COG ID[COG1508] DNA-directed RNA polymerase specialized sigma subunit, sigma54 homolog 
TIGRFAM ID[TIGR02395] RNA polymerase sigma-54 factor 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.167711 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGTTGG GTCCGCGCCT CGATCTCCGC CAGTCGCAAT CGCTGGTGAT GACGCCGCAG 
TTGCAGCAGG CGATCAAGCT GCTGGCGCTG TCGAACCTCG AGCTTGAGGC CTATCTGGCC
GAGGCGTTGG AAGGCAATCC GCTGCTCGAC ACGGGCGCAC CCGACAGCGA GGGCGGGGGC
GACGAGCCCG ACGGCGATGC GCCGCCCGCG GCGGAAACGC CTGCCGACAC CGATCGGGCG
CTGGCGTCCG ATGCCGGCAC CGCCGACGAC CTCGACGTCG ATTTCACCGA AGAGCGTTTC
CATCATGACA GCGCGAGCGA CAGCGTCGGC CTGTCGGGTG CGGGCGAGGA CGTGGATTTC
GACAGTTTCG CGGAGGCCGA GGGCTCGCTC CACGACCATC TGCTCGCGCA GGTCGGCGAG
CGGCTCGACG GGATCGAGGC GATCATCGCG GGCCAGCTCG TCGCGCTGAT CGACGAGGCG
GGCTATCTGC GCGCCGACCT TGCCGAACTC GCCGCACAGC TCGGCGTGCC CCTGGCGCTG
GTCGAGGCGG TGCTGCGGGT GGTCCAGGGC TTCGACCCCG CGGGCGTCGG CGCGCGCGAC
CTCGCCGAAT GTATCGCCAT CCAGGCGCGC GAGGCCGACC GCTATGACCC CGCGATGGCG
ACGATGATCG CGCACCTCGA TCTTGTCGCC AAGGGCGCCT TCCCGCAACT GAAGCGCATC
TGCGGCGTCG ATGACGAGGA TCTGGCCGAC ATGATCCGCG AACTGCGCGG CTATGATCCC
AAACCGGGCT TCAGGTTCGG CGGCTCGCCG GTGCAGGCGG TCGTTCCCGA CCTGTATGTC
CGGCGCACCG CGGCGGGCTG GGCGGTCGAG GTGAACAGCG CGACCCTGCC GCGGCTGCTC
GTCAACCGCC GCTATTACAA CGAACTCGCC GCGGGCGCGG CGGCAAAAAG CAAGGCGTGG
CTGTCCGAAC AGCTTGCCGG CGCCAACTGG CTGGTGCGCG CGCTCGACCA GCGCCAGCGC
ACGATCGTCA AGGTGGCGAG CGAGATCGTC AAGCAGCAGG AGGGGTTTTT TCTGCACGGC
GTCGCGCATA TGCGGCCGCT GACCTTGCGC CAGGTCGCCG AGGCGATCGG GATGCACGAA
TCGACCGTCA GCCGCGTCAC CAGCAACAAA TATCTGTCGT GCCCACGCGG CCTGTTCGAG
CTCAAATATT TCTTCTCGTC GGGCATATCG GCGACCGAGG GCGACGGCGC GGTGTCGGCC
GAGGCGGTCA AGAGCCGGAT CAAGGCGTTG ATCGAGGGCG AGGACGCGCG CGCGATCCTG
TCCGACGAGA CGATCGCGCA GAAACTGTCG GCCGAAGGAT TCGACATCGC GCGGCGCACC
GTCGTCAAAT ATCGCGAGGC GATGGGTTAT GGCTCGTCGG TGCAGCGACG GCGGCAGAAA
GCGCTGGCGG GATAG
 
Protein sequence
MALGPRLDLR QSQSLVMTPQ LQQAIKLLAL SNLELEAYLA EALEGNPLLD TGAPDSEGGG 
DEPDGDAPPA AETPADTDRA LASDAGTADD LDVDFTEERF HHDSASDSVG LSGAGEDVDF
DSFAEAEGSL HDHLLAQVGE RLDGIEAIIA GQLVALIDEA GYLRADLAEL AAQLGVPLAL
VEAVLRVVQG FDPAGVGARD LAECIAIQAR EADRYDPAMA TMIAHLDLVA KGAFPQLKRI
CGVDDEDLAD MIRELRGYDP KPGFRFGGSP VQAVVPDLYV RRTAAGWAVE VNSATLPRLL
VNRRYYNELA AGAAAKSKAW LSEQLAGANW LVRALDQRQR TIVKVASEIV KQQEGFFLHG
VAHMRPLTLR QVAEAIGMHE STVSRVTSNK YLSCPRGLFE LKYFFSSGIS ATEGDGAVSA
EAVKSRIKAL IEGEDARAIL SDETIAQKLS AEGFDIARRT VVKYREAMGY GSSVQRRRQK
ALAG