Gene Arth_3349 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3349 
Symbol 
ID4444078 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3764719 
End bp3766029 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content70% 
IMG OID639691172 
ProductRNA polymerase ECF-subfamily sigma factor 
Protein accessionYP_832824 
Protein GI116671891 
COG category[K] Transcription 
COG ID[COG4941] Predicted RNA polymerase sigma factor containing a TPR repeat domain 
TIGRFAM ID[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.220451 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGGGC GCGGTTCCGC TGCCGGGGGT TCCGCCTCGG GGAATTCCAG CAGCGAGATA 
GCCCGGATCT TCCGCCGGGA GTACGGCCGC GCCGTGGCAG TGCTGGTCCG GCTCTTCGGC
AGCATCGACC TCGCCGAGGA CGCCGTGCAG GACGCGTTCA CGGCGGCGGT GCAACGCTGG
CCCTCCAGCG GCGTCCCGCC CAGCCCGGCC GGATGGATTA TCACCACGGC CCGCAACAGG
GCAGTCGACC GGCTCCGGCG TGACGCCGCC CGCGACGACA AATACGCCAG GGCCGCCCTG
CTGCATGCCC GCGCCGGAGA CGCCTCCGGC GCGGCGCCCG AGGATCTGCT GATGGATGAG
CTGGAAGAGG AGGCCGGGGT GCGCGATGAC ACGCTGCGGC TGATCTTCAC CTGCTGCCAC
CCGGCCCTGG GAACCCCGGC CCGCGTGGCG CTGACGCTAC GCCTCCTGGG CGGGCTGAGC
ACCGCGGAGA TAGCCCGCGC CTTCATGGTG CCGGAAAAGA CCATGGCCCA GCGGCTGGTC
CGGGCCAAGG CGAAAATACG GGACGCCCGG ATTCCCTACC GCGTGCCCCA CGGTTCCGAG
CTGCCGGAGA GACTGACGGC CGTTCTCGCT GTGGTCTACC TCATCTTCAA TGAGGGCTAC
AGCGCAAGCT CCGGCGACGC ACTGGTCCGG GTCGAGCTCT GCGGGGAGGC CGTCAGACTG
GCCCGGCTGC TGGTGGCCCT GATGCCGGAT GAACCCGAAG CCCAGGGGCT TCTTGGGCTG
CTGCTGCTGG TGGAGTCGCG GCGCGCAGCC AGGATGGCAC CCGACGGCGG CATGGTGCTG
TTGGCGGACC AGGACCGGCA GCTGTGGGAC AAGGACCTGA TCCTTGAGGG GCAGGCCCTT
GTGCGCCGGT GCCTTCGCCG GAACCGGCCG GGACCGTACC AACTTCAGGC CGCCATCAAC
GCTGTGCACA GTGATTCCCC GTCAGCCAGC GAAACGGACT GGGAGCAGAT CCTACAGTTG
TACGATCAGC TCCTGCAGGC GTCGCCGGGT CCGGTGGTGG CACTCAACCG CGCGGTGGCC
GTTGCCGAAG TGCACGGCCC TGAGGCAGCC CTCGGCCTGG TCGACGCCCT GGAACTGGCA
GGCTACGGGG TGTTCCACTC CGTGCGCGCG GATCTCCTCC GGCGCCTGGG CCGCTTTTCC
GAAGCCAGGG AGGAATACCG CGACGCACTG GGGCTGGCAG GCAACGCGGC CGAGAGGCGG
TTCCTGGAAG GCCGGCTGCT TGGGCTGCCC GCGGCGGACC GGCCGAGTTA A
 
Protein sequence
MTGRGSAAGG SASGNSSSEI ARIFRREYGR AVAVLVRLFG SIDLAEDAVQ DAFTAAVQRW 
PSSGVPPSPA GWIITTARNR AVDRLRRDAA RDDKYARAAL LHARAGDASG AAPEDLLMDE
LEEEAGVRDD TLRLIFTCCH PALGTPARVA LTLRLLGGLS TAEIARAFMV PEKTMAQRLV
RAKAKIRDAR IPYRVPHGSE LPERLTAVLA VVYLIFNEGY SASSGDALVR VELCGEAVRL
ARLLVALMPD EPEAQGLLGL LLLVESRRAA RMAPDGGMVL LADQDRQLWD KDLILEGQAL
VRRCLRRNRP GPYQLQAAIN AVHSDSPSAS ETDWEQILQL YDQLLQASPG PVVALNRAVA
VAEVHGPEAA LGLVDALELA GYGVFHSVRA DLLRRLGRFS EAREEYRDAL GLAGNAAERR
FLEGRLLGLP AADRPS