Gene Arth_3195 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3195 
Symbol 
ID4444185 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3600394 
End bp3601905 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content57% 
IMG OID639691021 
Productlipopolysaccharide biosynthesis 
Protein accessionYP_832673 
Protein GI116671740 
COG category[D] Cell cycle control, cell division, chromosome partitioning
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0489] ATPases involved in chromosome partitioning
[COG3944] Capsular polysaccharide biosynthesis protein 
TIGRFAM ID[TIGR01007] capsular exopolysaccharide family 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGATCTAC ACGACAGCTT GCGGATTTTG CGCCGTAATT GGATTCTCAT CGTGGCCGCT 
ACCCTTGCTG GCCTCCTCAT GAGCGGTGGC GCATCCGTCC TCACCAAGCC GACGTATCGG
GCTGACACTC AGTTATTTGT GGCAATTCAG AGTTCTGGTT CGGTCCAGGA GCTCCAGCAG
GGCAACACCT TCAGCCAGGC CAGAGTCCAG TCATACGTAA AGACGGTGGC GTCGCCCATA
GTATTGCAGC CAGTTATCGA TACGCTCGGC CTCCCCAGCA CGGCTGAGGA ATTAGCAGGG
CGAGTGAAGG CCAGCACGGA CCTAAACACT GTGTTGATAA ACATCTCCGT CGAGGACAAC
TCCCCAGTTC AGGCGGCCGC AACAGCTCAA GCCGTTGCCG ATAGCCTTAT CAGGGTCGTT
GACACGTTAG AAAAGCCGAA AACGGGAGGA ACCTCGCCGG TCAGCCTTTC GATCACAAAA
CCTGCAAAAG CGCCTTCTGT GCCATCAGCT CCTGATACGC GTCTTAACCT AATGGTTGGC
CTCCTTTTGG GCCTTGCCGC CGGGGTGGGC GCCGCCCTAC TCCGGACGTC ACTCGACAAC
AGAATTAGAG GGGAGGCTGA CTTGCGACGA GTCACTGAGT TGCCGCTCCT CGGTGGGATT
TCATTTGACC AGGATGCGAC GCGCAAGCCT TTGCTCACCC AAACTGCTGC GCAGAGTCCC
CGATCAGAGT CGTTCAGGCA ATTGCGGACC AATCTCCAAT TTGCAAATGT TTCAGGACGT
GCCAGCACCG TAGTCGTAAC TTCTTCGCTC CCGGGGGAGG GCAAGAGCAC GACTGCAACA
AACTTGGCCA TTTCCCTTGC GCAGGCTGGC CAGAAGGTTT GCCTTATCGA CGCAGACTTG
CGTCGTCCTA TGATCAACGA ATATCTCGGA CTCGACCGAA GCGCCGGGTT GACTACAGCT
CTGGTCGGGT TGGCCGACGT CAGTGATCTC CTCCAGCCGT GGGGTGATGA CAGTCTCTAC
GTACTGGCTT CGGGACAGAT ACCACCAAAC CCAAGTGAAC TCCTCGGGTC GGATGAGATG
AAACAACTAA TCACGCGGTT AGAAGACGCT TTCGATACGG TCGTAATTGA CGCACCTCCC
CTTCTACCCG TCACAGACGC TGCAGTCTTG TCTCAACACG TAGGCGGAGT TGTCGTCGTC
GTGGGATCTC AGAAGCTTCG CCAAAACGAT CTGGAGAAAT CCCTCGGCGC CCTGAACATG
GTCGGCGCTA ACGTTTTGGG AATCGTCCTG AACCGACTTC CGGTGAAGGG ACCCGACTCA
TATGCCTACA CCTACTACAG CCACGACGGT GCGACAGCCA CCGCAAAGAG GCCCAAGGCC
AACGCTCAAG AGCGCGGACA ACAAACAAAT GCTGGCGAGC CAGACCTTCA CTACGGTGAC
TTCGATCGGC AAATTCTGGA AACGCAATCT CAGCCCGCAC ACGTGTTTCC TCGGACTAGC
GCGGAGCGGT AG
 
Protein sequence
MDLHDSLRIL RRNWILIVAA TLAGLLMSGG ASVLTKPTYR ADTQLFVAIQ SSGSVQELQQ 
GNTFSQARVQ SYVKTVASPI VLQPVIDTLG LPSTAEELAG RVKASTDLNT VLINISVEDN
SPVQAAATAQ AVADSLIRVV DTLEKPKTGG TSPVSLSITK PAKAPSVPSA PDTRLNLMVG
LLLGLAAGVG AALLRTSLDN RIRGEADLRR VTELPLLGGI SFDQDATRKP LLTQTAAQSP
RSESFRQLRT NLQFANVSGR ASTVVVTSSL PGEGKSTTAT NLAISLAQAG QKVCLIDADL
RRPMINEYLG LDRSAGLTTA LVGLADVSDL LQPWGDDSLY VLASGQIPPN PSELLGSDEM
KQLITRLEDA FDTVVIDAPP LLPVTDAAVL SQHVGGVVVV VGSQKLRQND LEKSLGALNM
VGANVLGIVL NRLPVKGPDS YAYTYYSHDG ATATAKRPKA NAQERGQQTN AGEPDLHYGD
FDRQILETQS QPAHVFPRTS AER