Gene Arth_2228 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2228 
Symbol 
ID4445289 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2506390 
End bp2507955 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content65% 
IMG OID639690037 
Productcell envelope-related transcriptional attenuator 
Protein accessionYP_831708 
Protein GI116670775 
COG category[K] Transcription 
COG ID[COG1316] Transcriptional regulator 
TIGRFAM ID[TIGR00350] cell envelope-related function transcriptional attenuator common domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000756142 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGTGCGAC GCCGAGACAC TCCCGGACAA GGCATTGGTT CGCCAGCGTC CGACGCCGCT 
GCCAGGCATT CGGACACGAA GGACATGGGC CCCGCCCGGC ACCTCGGCGC CATGGGCGGA
AAGCCCGCGT GGTTCAAGGT GGCCACTGCT GTTGTGGCCC TTGTCCTGGT GGGCGCTCTC
GCTTTCGCCG CCTTCTGGGT CATCCGCTTG CAGATGAACA TCAGCAAAGC TCCACTCGGC
GCAGGCAGCA GCCGCACCGA AGATCCCGTC AACGACTCCA AGGACCGGAT GCAGATCCTG
ATCCTGGGCT CGGACACCCG CGACGGCAAG AATTCCGACT ACGGCACGGC CGAGGACTCC
ACAGGCTACG GCCAGTCGGA CGTCATGATG ATGATGGACA TCTCGGCGGA CAACAAGCGT
GTCAGCGTCA TCAGCTTCCC GCGCGACCTG CTCGTGGACA TTCCCGAATG CACGGACCAG
AAGACCAAGC AGGTGTTCCC GGCCCGAAGC GGCGTGATGA TCAACGAAGC CATGAAAGAG
GCCGGCATCG GCTGCGCCGT GGACACGGTG AACAAAATCA CGGGGTTGGA AATCGACCAC
TTCATGATGG CGGACTTCAA CGCGGTCAAG GAACTTTCCA ACGCGGTGGG CGGCGTGGAA
GTCTGCGTAA GTGACGCCGT CTACGACCCC GACTCCCGCC TGCGCCTCCC CGCAGGAAAC
TCGCAGGTGC AGGGCGAGCA GGCGCTGGCC TACCTGCGGA CCAGGCATGC CTTCGCGGAC
GGCGGTGACC TGGGCCGCAT CAAGGCGCAG CAGGGCTTCC TGTCATCCCT CACCCGCAAG
ATCAAGGATG ACGGCACACT GTCCGACCCC CAGAAGATGC TCAAGATTGC CGACGTCGTC
ACGCAGAACC TTACGGTGGA TGATGGACTG GCGTCCGTCC CGTCGCTGCT GACCATCGGC
AACCGGCTCA AGAACATTGA CATCAGCAAG GTGGCGTTCG TTGCCGTGCC AACCACGCCT
GCTCCCACTG ATCCCAACCG GCTCACCGTT GCCGAGCCGG CCGCATCGCA GCTTTTCGCC
GCGCTGCGCA AGGACGTCGA CCTGACCGAC CCGACAGCCA CCCCGAGCCC CACGGCGGAG
CCGAGCGAAT CGGCTCCCGC CCCGACGCCG ACCGAAACGC CGCTGCCGCC CTACGATAAG
GCGCTGCAGC CGGTGACCGT CGCGAACGGA ACGGGTGTTC CGGCGCGGAC CCAGGAGATC
ACCCAGGCGA TCATCGCCGG CGGCTTCACC CAGGTGGCCC CGCTTGTGGC GCAGCCTGTC
GCGAAGACGG CGGTCTACTA CGGACCCGGC TTCGAGGACG TGGCGGCGGA CGTCGCAGCA
TTGCTGGAAA TACCCGCCAC GCAGGTTCTC CCGGCGGCCG GCGTCAGCGG AGTTCAGGTC
TACCTCGGCA CCGACTTCAT GTCCGGAACA AAGATGGACT CCGTGCCGCT CCCGTCCGAC
ATTGTCAACC AAACGGCCGG CGACACCGTC TGCCAGCAGG CGAACCCTGA ACTGATCGTC
CGCTAG
 
Protein sequence
MVRRRDTPGQ GIGSPASDAA ARHSDTKDMG PARHLGAMGG KPAWFKVATA VVALVLVGAL 
AFAAFWVIRL QMNISKAPLG AGSSRTEDPV NDSKDRMQIL ILGSDTRDGK NSDYGTAEDS
TGYGQSDVMM MMDISADNKR VSVISFPRDL LVDIPECTDQ KTKQVFPARS GVMINEAMKE
AGIGCAVDTV NKITGLEIDH FMMADFNAVK ELSNAVGGVE VCVSDAVYDP DSRLRLPAGN
SQVQGEQALA YLRTRHAFAD GGDLGRIKAQ QGFLSSLTRK IKDDGTLSDP QKMLKIADVV
TQNLTVDDGL ASVPSLLTIG NRLKNIDISK VAFVAVPTTP APTDPNRLTV AEPAASQLFA
ALRKDVDLTD PTATPSPTAE PSESAPAPTP TETPLPPYDK ALQPVTVANG TGVPARTQEI
TQAIIAGGFT QVAPLVAQPV AKTAVYYGPG FEDVAADVAA LLEIPATQVL PAAGVSGVQV
YLGTDFMSGT KMDSVPLPSD IVNQTAGDTV CQQANPELIV R