Gene Ent638_3473 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_3473 
Symbol 
ID5112977 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp3780777 
End bp3782672 
Gene Length1896 bp 
Protein Length631 aa 
Translation table11 
GC content52% 
IMG OID640493677 
ProductRNA polymerase sigma factor RpoD 
Protein accessionYP_001178183 
Protein GI146313109 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0442977 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.143212 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGGTAG CTGCACCGCC GACACCAATC AAACGAAATA AGTGTGGATA CCGTCTTATG 
GAGCAAAACC CGCAGTCACA GCTGAAACTT CTTGTCCAAC GCGGTAAGGA GCAAGGCTAT
CTGACCTATG CCGAGGTCAA TGACCATCTG CCGGAAGATA TCGTCGATTC AGATCAAATC
GAAGACATCA TCCAAATGAT CAATGACATG GGTATTCAGG TGATGGAAGA AGCACCTGAT
GCCGATGATC TGTTGCTGGC TGAAAACTCC AACAACACCG ATGAAGATGC TGAAGAAGCC
GCTGCACAGG TTCTGTCCAG CGTTGAATCT GAAATCGGTC GTACCACTGA CCCGGTGCGC
ATGTACATGC GCGAAATGGG AACCGTTGAA CTGCTGACCC GCGAAGGCGA AATCGACATC
GCGAAACGCA TCGAAGACGG GATCAACCAG GTTCAGTGTT CTGTTGCCGA GTACCCGGAA
GCGATCACCT ATCTGCTGGA ACAGTACGAT CGCGTAGAAG CAGAAGAGGC GCGTCTTTCC
GACATCATTA CCGGTTTCGT CGATCCTAAC GCTGAAGAAG AAGTCGCTCC GACTGCTACT
CACGTTGGTT CTGAGCTCAC GAAAGAAGAG CGTGAAGAGA ACGAGGAAGA AGACGAAGAA
GACGAAGAAG AAGAAGACGA CAACAGCATC GATCCTGAGC TGGCTCGCGA GAAGTTTGGC
GAACTGCGTA CGCAGTACGA ACTGGCCCGC GACACCATCA AAGCAAAAGG CCGTAGTCAC
GCCGCTGCGA AGGAAGAGAT CCAGAAGCTG TCTGACGTGT TCAAGCAGTT CCGCCTGGTA
CCAAAGCAGT TTGATTACCT GGTCAACAGT ATGCGCGTCA TGATGGATCG CGTGCGAACT
CAGGAACGCA TCATCATGAA ACTGTGCGTT GAACAGTGCA AAATGCCGAA GAAAAACTTC
ATCACACTGT TCACCGGCAA CGAAACCAGC GAAACCTGGT TCAACGCTGC TATCGCCATG
AACAAACCGT GGTCTGAAAA GCTGCACGAC GTCTCTGATG ACGTTCAGCG CGGCTTGCAG
AAACTGCGTC AGATTGAAGA AGAGACCGGC CTGACCATCG AGCAGGTGAA AGACATCAAC
CGTCGTATGT CTATCGGCGA AGCGAAAGCC CGTCGTGCGA AGAAAGAGAT GGTTGAGGCG
AACTTGCGTC TGGTTATTTC TATCGCGAAG AAATACACCA ACCGCGGTCT GCAGTTCCTG
GATCTGATTC AGGAAGGCAA CATCGGTCTG ATGAAAGCGG TAGATAAGTT TGAATACCGT
CGTGGTTATA AATTCTCCAC TTACGCAACC TGGTGGATTC GTCAGGCGAT CACCCGCTCT
ATCGCGGATC AGGCGCGCAC CATCCGTATT CCGGTGCATA TGATTGAGAC GATTAACAAG
CTCAACCGTA TTTCTCGCCA GATGCTGCAA GAGATGGGCC GCGAGCCAAC GCCGGAAGAA
CTGGCTGAAC GCATGTTGAT GCCGGAAGAC AAGATCCGTA AAGTGCTGAA AATCGCGAAA
GAGCCAATCT CCATGGAAAC GCCAATCGGC GACGATGAAG ATTCGCATCT GGGTGATTTC
ATCGAGGATA CTACCCTCGA GCTGCCGCTG GACTCTGCGA CGACCGAGAG CCTGCGTGCT
GCCACTCACG ACGTTCTGGC CGGCCTGACC GCCCGCGAAG CGAAAGTCCT GCGTATGCGT
TTCGGTATCG ACATGAATAC CGACCACACG CTGGAAGAAG TGGGTAAACA GTTCGACGTA
ACCCGCGAAC GTATTCGTCA GATCGAAGCG AAGGCACTGC GCAAACTGCG CCACCCTAGC
CGCTCTGAAG TTCTGCGTAG CTTCCTGGAC GATTAA
 
Protein sequence
MLVAAPPTPI KRNKCGYRLM EQNPQSQLKL LVQRGKEQGY LTYAEVNDHL PEDIVDSDQI 
EDIIQMINDM GIQVMEEAPD ADDLLLAENS NNTDEDAEEA AAQVLSSVES EIGRTTDPVR
MYMREMGTVE LLTREGEIDI AKRIEDGINQ VQCSVAEYPE AITYLLEQYD RVEAEEARLS
DIITGFVDPN AEEEVAPTAT HVGSELTKEE REENEEEDEE DEEEEDDNSI DPELAREKFG
ELRTQYELAR DTIKAKGRSH AAAKEEIQKL SDVFKQFRLV PKQFDYLVNS MRVMMDRVRT
QERIIMKLCV EQCKMPKKNF ITLFTGNETS ETWFNAAIAM NKPWSEKLHD VSDDVQRGLQ
KLRQIEEETG LTIEQVKDIN RRMSIGEAKA RRAKKEMVEA NLRLVISIAK KYTNRGLQFL
DLIQEGNIGL MKAVDKFEYR RGYKFSTYAT WWIRQAITRS IADQARTIRI PVHMIETINK
LNRISRQMLQ EMGREPTPEE LAERMLMPED KIRKVLKIAK EPISMETPIG DDEDSHLGDF
IEDTTLELPL DSATTESLRA ATHDVLAGLT AREAKVLRMR FGIDMNTDHT LEEVGKQFDV
TRERIRQIEA KALRKLRHPS RSEVLRSFLD D