Gene TM1040_3856 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3856 
Symbol 
ID4074919 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008042 
Strand
Start bp107815 
End bp109062 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content66% 
IMG OID638004513 
ProductRNA polymerase ECF-subfamily sigma factor 
Protein accessionYP_611248 
Protein GI99077989 
COG category[K] Transcription 
COG ID[COG4941] Predicted RNA polymerase sigma factor containing a TPR repeat domain 
TIGRFAM ID[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones56 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACATG CCGTCCCCAG TGCCATCGCC GTGCAGCAGG TCATCGATGA TGTGATGCGC 
CGCGACAGGG GGCGGCTGCT GGCGGGCCTG AGCGTCCGCC TCGGGGATAT CCAGCTGGCA
GAAGACGCTC TGCAGGAGGC GGTGATTGCG GCGCTCAAAC ACTGGGGCCG CTCCGGCGTG
CCTCATGCGC CGCTGGCCTG GCTGATGCGG GCAGGGTTCA ACAAGGGGAT CGACCAGCTG
CGCAGCCGCC AGCGCGAGGG GCGCAAGGCA GAGGATCTGG GGCTGATTGC CCCCGACCAA
GACGCGGCAG AGCAGGCCGA GACCATCCCA GATGCGCGCC TGCGGCTGAT CTTCACCTGC
TGCCATCCGG CGCTGGAGGA AAAATCCCGC GTCGCCCTGA CCCTGCGCAC GGTCTGCAGC
CTGAGCACGC GCGACATTGC CGCGGCCTTT CTCGACAGCG AGCAGACCAT GGGCCAGCGC
CTGTCCCGCG CCAAGGCCAA GATCCGTGCC AAAGGCATCG GGTTTCAGGT GCCAGAGCCC
GACCAGTGGT CCGAACGGCT CGGCACGGTG CTCTCCACGC TCTATCTGAT CTTCACAACC
GGCTATGTGC AGGAGGAGGC AGGCCCGCGC GATTTCTGTC GCGAGGGGAT CTATCTTGCG
CGGCTTTTGT GCGCGCTGCG CCCGGATGAT CCCGAAATCG AGGGGGCGCT GGCCCTGATG
CTGCTGACCG AAGCACGCAG CGCCGCCCGC ATTGGCCCGG ATGGGGCGAT GCGCCCGATC
GAGGACCAGG ACAGCAGCCT CTGGCACCAT GAGACCATCA CCGAGGCGCA GGCTCTGCTG
GCACAGGCGG TTCTGCGCCG CCAGCCCGGC GCGTTTCAGA TCAAGGCCGC CCTTGCGGAT
TGCCACATGA TGCGCCCAAA GCCTGATTGG GCGCAGATGG CGCTGCTCTA CCAGGCGCTC
TGGCGGTTTG AGCCAACCCC GGTGGTGGCA CTCAACCAGG CGGTGGTGAT GGCAGAGCTG
GGGCAAGGCG CTCAGGCCCT GAACCACTTG CGTGCCCTAG AGGATGACCT GGGACAGTTT
CAACCATGGC ATGCGGCAAT GGCACATGTG CTGGCACAGG AAGGCCATAT CGGGGACGCC
CGCTGTGCCT ATGAGCAGGC CATAAAAACC GCTCCCCATG ACGCTGCGCG CCGGTTTCTG
GAAACCAGAG TGCAGAGACT GCCCCCCTGC GCTTTCAGCG GATCTTAA
 
Protein sequence
MTHAVPSAIA VQQVIDDVMR RDRGRLLAGL SVRLGDIQLA EDALQEAVIA ALKHWGRSGV 
PHAPLAWLMR AGFNKGIDQL RSRQREGRKA EDLGLIAPDQ DAAEQAETIP DARLRLIFTC
CHPALEEKSR VALTLRTVCS LSTRDIAAAF LDSEQTMGQR LSRAKAKIRA KGIGFQVPEP
DQWSERLGTV LSTLYLIFTT GYVQEEAGPR DFCREGIYLA RLLCALRPDD PEIEGALALM
LLTEARSAAR IGPDGAMRPI EDQDSSLWHH ETITEAQALL AQAVLRRQPG AFQIKAALAD
CHMMRPKPDW AQMALLYQAL WRFEPTPVVA LNQAVVMAEL GQGAQALNHL RALEDDLGQF
QPWHAAMAHV LAQEGHIGDA RCAYEQAIKT APHDAARRFL ETRVQRLPPC AFSGS