Gene Arth_2230 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2230 
Symbol 
ID4445291 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2509047 
End bp2510390 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content64% 
IMG OID639690039 
ProductCBS domain-containing protein 
Protein accessionYP_831710 
Protein GI116670777 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00904717 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACCCCAC TGATCCTTGT TGGCATGGCG CTGGTCTTCC TCAGTTTCGC AGCCCTGCTG 
ACAGCGGCTG AGTCCGCGTT TAATTTCCTT CCCCGCCACG ACGCCGAAGA GGCCGTGCTT
CGAAGTGACG GCCAGGCCCT GAAGCGCATC CTGGACCAGC CAGTCGCGCA CATGCGGTCC
CTGAGGTTTT GGCGGATCTG GTTCGAAATG GCCTCGGCAG TGGCGGTCGC TGTCCTTCTG
CACAGCCTGC TGGACAGTGT ATGGCTCGCC GGCCTTGCCG CCACCGGAAT CATGGCCCTC
GTAGGCTTCG TGATCGTGGG GGTGTCACCG CGGCAGCTGG GCCGTGCCCA CGCCGCCGGC
GTCGCCAGGT TCAGTGCGCC GATCATCCGT TTCCTATGCT GGGTGCTGGG TCCCATTCCG
GGCTGGCTGG TTGCGCTGGG CAGCGTCGTG GCACCGGGCG CACGCGCCGG CGACGAAGCC
TCCTTCAGTG AAGAAGAGTT CCGCGAGCTC GTGGACCGTG CCACTGAATC TGACATGATC
GAGGACAACG AAGCCGAACT GATCCAGTCC GTGTTCGACT TCGGCGACAC CCTGGTCCGG
GCCGTGATGG TGCCCCGCAC GGACATCCTC AGCATCGACG CGGGCTCGAG CCTGCACCGG
GCCATGTCCC TCTTCCTGCG GTCCGGCTAC TCCCGGATCC CCGTGATCCG CGACAACACG
GACCAGATCC TGGGCATCAT CTACCTCAAG GATGTCGCCG CCGCGCTGCA CGGCCTCGGC
CCGGGCGAGG AACCCCCCAT CGTGGATGAC CTTGCCCGCG AAGTCCGCTA CGTGCCGGAG
TCGAAGCAGG TCAGTGACCT GCTTCGTGAA CTGCAAAAGG AATCAACGCA TGTGGCCATC
GTGATCGACG AGTACGGCGG AACTGCCGGA CTTGTGACGC TTGAGGATCT GATCGAGGAA
ATCGTCGGCG AGATTGTGGA TGAATATGAC ACCGAGAGCG CCGAGGCCGT GGCCCTTGGC
AACGGCAGCT ACCGGGTGAG TGCCCGGATG GGCATCGACG ACCTCGGCGA GCTGTTTGAT
GTGGAACTCG ACGACGACGA AGTGGACACC GTCGGCGGCC TGCTCGCCAA GGCCCTCGGC
CGGGTTCCCA TCGTCGGCAG CACCGTAGAG GTGGACGGGA TCTCGCTGCG GGCGGAACGC
TTGGAAGGCC GCCGCAACAG GGTCAGCCAC ATCATCGCGG CGCCCGTTGC AAAGGGCGCC
GTTCCAGAAC AAACTGACCT TGAAGACCTA CTCGACGAGG CCGAAACAAT GCAACAGGGA
GTTCCACGTG AGCAAGCAGA ATAA
 
Protein sequence
MTPLILVGMA LVFLSFAALL TAAESAFNFL PRHDAEEAVL RSDGQALKRI LDQPVAHMRS 
LRFWRIWFEM ASAVAVAVLL HSLLDSVWLA GLAATGIMAL VGFVIVGVSP RQLGRAHAAG
VARFSAPIIR FLCWVLGPIP GWLVALGSVV APGARAGDEA SFSEEEFREL VDRATESDMI
EDNEAELIQS VFDFGDTLVR AVMVPRTDIL SIDAGSSLHR AMSLFLRSGY SRIPVIRDNT
DQILGIIYLK DVAAALHGLG PGEEPPIVDD LAREVRYVPE SKQVSDLLRE LQKESTHVAI
VIDEYGGTAG LVTLEDLIEE IVGEIVDEYD TESAEAVALG NGSYRVSARM GIDDLGELFD
VELDDDEVDT VGGLLAKALG RVPIVGSTVE VDGISLRAER LEGRRNRVSH IIAAPVAKGA
VPEQTDLEDL LDEAETMQQG VPREQAE