Gene Arth_3221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3221 
Symbol 
ID4444211 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3627591 
End bp3629180 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content68% 
IMG OID639691045 
Productlipopolysaccharide biosynthesis 
Protein accessionYP_832697 
Protein GI116671764 
COG category[D] Cell cycle control, cell division, chromosome partitioning
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0489] ATPases involved in chromosome partitioning
[COG3944] Capsular polysaccharide biosynthesis protein 
TIGRFAM ID[TIGR01007] capsular exopolysaccharide family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.929706 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGTCA ACATTGCGGC CGGCGCAGAA GCACCTGCCG GCCTGGACCT GGCAGACTAC 
CTTCGGGTGG TGCGGGTGTA CTGGAAGGCC ATCGTCGCCT TCACCCTGCT GGCCACATTG
ACGGCGTTCG GCTGGACCGT CCTTCAGCCC AAGATCTACT CGTCCGATTC CAGCGGCATT
GTGGTCACAC CGGGCTCGGA CAACGTGAGC CTTTCACTGG CCGGGGACAG CCTGGCCAAG
GCAAAGGTCA AGAACTACGA GTCCGTGGCC AAGTCCCGCC TCGTGGCTGA CCGGGTCATC
GCCTCCCTGG AACTTAAAAC AACCGCAGAC GCGCTACTCG GCACCATCAG CGTCAAGGTT
CCGCTGGATA CCGCCGAGAT CCGGGTGACG GCCCAGTCGC CCGATCCGGC AACCGCCCAG
CGCGTGGCCG ATGCCTGGGT CAACGGCCTC GCCGCCCAGG TGGAAGCGAT CGAAACAGCC
ACACCCGGAA CAGCAACGCC CGACGCCGGC ACCTCGCCGG ACGCTGCCAC CGCACCAGCG
GCGACGGCCT CCGCCGTCAG GATCCTCCCG CTGGGCAAAG CGGTCCTTCC CACCAGCCCG
GTGTCCCCGA ACGTCAAGCT CACCCTGGCG CTCGGTGCAC TCATCGGTCT GGCCCTCGGC
GTGGCCTATG CCCTGGTCCG CCGGCACCTG GACCGGCGCA TCCGCAACGC CACCGAAATC
GAACGGCTCT TCGACGTCCC GGTGATCGGC ACCCTCCCCG TGGACCACCG CCTGGACGAG
AAAAGCACCA TCCTGGACGC TGGAGCCGCA GCCCAGCTGC ACGACGCCGG CGGGGCGATG
GCCGAGGCCC TCCGCGAACT GCGCACCAAC CTCAGCTTCC TGGACGTGGA CCAGCCGCCG
CGGATCATCG TGGTCACCAG CTCCATGCAG GCCGAAGGAA AGTCCACCGT CACCGCCAAC
CTGGCGGTCA CCATGGCGGC CGCCGGCGAG AACGTCGTAG TGGTCGACGG CGACCTCCGC
CGCCCCACGC TGGTGGACGT TTTCAACCTG GTTCCGGGAG TCGGGGTTAC CGACGTGCTC
ACCGGCACCG CTGAACTGGA GGATGTCCTC CAGCCCTGGG GTGCCCTGCC GAACCTCTCG
GTCCTCGGTT CCGGCCGCAT TCCGCCGAAC CCCAGCGAAC TGCTGGGCTC CAAGGCCATG
AAGAACATGC TCAACGCCCT GGCAGAGAAC GCAATCGTGC TGATCGACGC CCCGCCGCTG
CTGCCGGTCA CGGATGCTGC GGTACTCTCC CGCGTGGCGG ACGGCGCCAT CGTGGTGATC
CGGACGGGCC GGACCACCCA GGAGCAACTG GGCCAGTCCC TGGGCAACCT GGAAAAGGTG
AAGGGCCGCA TCCTGGGCGC CGTCCTGAAC TACGTGCCCA CCAAGGGCAC GGACGCCTAC
TCCTACTACG GGACGTACAC CTCGGCTCCT GAAACGCAGG ACCTCCCGGA GCTCGCCCAT
CCCGACGCCG TGGCGCACGA ACCGCAGTGG GACACCGAAC ACGACGACGT CCTGGAACCC
GCCGCGGCCG GCCGCCGCGC ACGGGCCTAG
 
Protein sequence
MSVNIAAGAE APAGLDLADY LRVVRVYWKA IVAFTLLATL TAFGWTVLQP KIYSSDSSGI 
VVTPGSDNVS LSLAGDSLAK AKVKNYESVA KSRLVADRVI ASLELKTTAD ALLGTISVKV
PLDTAEIRVT AQSPDPATAQ RVADAWVNGL AAQVEAIETA TPGTATPDAG TSPDAATAPA
ATASAVRILP LGKAVLPTSP VSPNVKLTLA LGALIGLALG VAYALVRRHL DRRIRNATEI
ERLFDVPVIG TLPVDHRLDE KSTILDAGAA AQLHDAGGAM AEALRELRTN LSFLDVDQPP
RIIVVTSSMQ AEGKSTVTAN LAVTMAAAGE NVVVVDGDLR RPTLVDVFNL VPGVGVTDVL
TGTAELEDVL QPWGALPNLS VLGSGRIPPN PSELLGSKAM KNMLNALAEN AIVLIDAPPL
LPVTDAAVLS RVADGAIVVI RTGRTTQEQL GQSLGNLEKV KGRILGAVLN YVPTKGTDAY
SYYGTYTSAP ETQDLPELAH PDAVAHEPQW DTEHDDVLEP AAAGRRARA