Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_3349 |
Symbol | |
ID | 4444078 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 3764719 |
End bp | 3766029 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639691172 |
Product | RNA polymerase ECF-subfamily sigma factor |
Protein accession | YP_832824 |
Protein GI | 116671891 |
COG category | [K] Transcription |
COG ID | [COG4941] Predicted RNA polymerase sigma factor containing a TPR repeat domain |
TIGRFAM ID | [TIGR02937] RNA polymerase sigma factor, sigma-70 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.220451 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGGGGC GCGGTTCCGC TGCCGGGGGT TCCGCCTCGG GGAATTCCAG CAGCGAGATA GCCCGGATCT TCCGCCGGGA GTACGGCCGC GCCGTGGCAG TGCTGGTCCG GCTCTTCGGC AGCATCGACC TCGCCGAGGA CGCCGTGCAG GACGCGTTCA CGGCGGCGGT GCAACGCTGG CCCTCCAGCG GCGTCCCGCC CAGCCCGGCC GGATGGATTA TCACCACGGC CCGCAACAGG GCAGTCGACC GGCTCCGGCG TGACGCCGCC CGCGACGACA AATACGCCAG GGCCGCCCTG CTGCATGCCC GCGCCGGAGA CGCCTCCGGC GCGGCGCCCG AGGATCTGCT GATGGATGAG CTGGAAGAGG AGGCCGGGGT GCGCGATGAC ACGCTGCGGC TGATCTTCAC CTGCTGCCAC CCGGCCCTGG GAACCCCGGC CCGCGTGGCG CTGACGCTAC GCCTCCTGGG CGGGCTGAGC ACCGCGGAGA TAGCCCGCGC CTTCATGGTG CCGGAAAAGA CCATGGCCCA GCGGCTGGTC CGGGCCAAGG CGAAAATACG GGACGCCCGG ATTCCCTACC GCGTGCCCCA CGGTTCCGAG CTGCCGGAGA GACTGACGGC CGTTCTCGCT GTGGTCTACC TCATCTTCAA TGAGGGCTAC AGCGCAAGCT CCGGCGACGC ACTGGTCCGG GTCGAGCTCT GCGGGGAGGC CGTCAGACTG GCCCGGCTGC TGGTGGCCCT GATGCCGGAT GAACCCGAAG CCCAGGGGCT TCTTGGGCTG CTGCTGCTGG TGGAGTCGCG GCGCGCAGCC AGGATGGCAC CCGACGGCGG CATGGTGCTG TTGGCGGACC AGGACCGGCA GCTGTGGGAC AAGGACCTGA TCCTTGAGGG GCAGGCCCTT GTGCGCCGGT GCCTTCGCCG GAACCGGCCG GGACCGTACC AACTTCAGGC CGCCATCAAC GCTGTGCACA GTGATTCCCC GTCAGCCAGC GAAACGGACT GGGAGCAGAT CCTACAGTTG TACGATCAGC TCCTGCAGGC GTCGCCGGGT CCGGTGGTGG CACTCAACCG CGCGGTGGCC GTTGCCGAAG TGCACGGCCC TGAGGCAGCC CTCGGCCTGG TCGACGCCCT GGAACTGGCA GGCTACGGGG TGTTCCACTC CGTGCGCGCG GATCTCCTCC GGCGCCTGGG CCGCTTTTCC GAAGCCAGGG AGGAATACCG CGACGCACTG GGGCTGGCAG GCAACGCGGC CGAGAGGCGG TTCCTGGAAG GCCGGCTGCT TGGGCTGCCC GCGGCGGACC GGCCGAGTTA A
|
Protein sequence | MTGRGSAAGG SASGNSSSEI ARIFRREYGR AVAVLVRLFG SIDLAEDAVQ DAFTAAVQRW PSSGVPPSPA GWIITTARNR AVDRLRRDAA RDDKYARAAL LHARAGDASG AAPEDLLMDE LEEEAGVRDD TLRLIFTCCH PALGTPARVA LTLRLLGGLS TAEIARAFMV PEKTMAQRLV RAKAKIRDAR IPYRVPHGSE LPERLTAVLA VVYLIFNEGY SASSGDALVR VELCGEAVRL ARLLVALMPD EPEAQGLLGL LLLVESRRAA RMAPDGGMVL LADQDRQLWD KDLILEGQAL VRRCLRRNRP GPYQLQAAIN AVHSDSPSAS ETDWEQILQL YDQLLQASPG PVVALNRAVA VAEVHGPEAA LGLVDALELA GYGVFHSVRA DLLRRLGRFS EAREEYRDAL GLAGNAAERR FLEGRLLGLP AADRPS
|
| |