Gene Arth_2643 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2643 
Symbol 
ID4444764 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2963563 
End bp2964900 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content65% 
IMG OID639690463 
Producthypothetical protein 
Protein accessionYP_832122 
Protein GI116671189 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAATGGC TACTGCTAGG AGCAGGCATC CTGCTCATCC TCGGTACGGG TTTCTTTGTC 
GCCGTGGAGT TCTCACTCGT CGCCCTTGAC CAGGCCACTG TCCAGCGGGC CGTGGACAGC
GGCGACGCCG CGGCCGTGCC GCTGCTCAAG TGCCTTAAAT CCCTGTCCAC CCAGCTGTCC
AGCTGCCAGC TCGGGATCAC GCTGACCACC CTGCTGACCG GCTATGTCAT GGAACCGTCC
GTAGGAAAGC TGCTGGAAGG GCCGCTTACC GCGGCCGGCG TTCCGGAAGC GGCAGCGGCT
TCGGTCTCCC TGGTGATGGC CATGACCATC GCCACCCTCC TGTCCATGCT GATCGGCGAG
CTCGTGCCCA AAAACATGGC CATTGCACTG TCCTTGCCGA TCGGCAAGGC TGTTGCGCGG
CCGCAGCTGG TGTTCACCGC CGTCTTCAAG CCCGCGATCA TCGTATTGAA CGGCTTCTCC
AACAAGGTGC TGAATGTCTT CGGACTTGAA GCCAAGGAGG AGATCTCCGG TGCGCGGACA
CCGGCCGAAC TGGCATCCCT GGTCCGCCGT TCCGCCGCCA TGGGAACTCT TGACCCGGGC
ACCGCAAACT TCATTGCCCG GACGCTGAAG TTCTCCGGCC GGACCGCCGC CGACGTGATG
ACGCCCAGGA TCCGGCTCGA AACCATCGGC GCCGCGCAGC CGGTGTCGGA CATCATCGAT
GCCGCCCGGC GGACAGGCTA TTCACGCTTC CCGGTCATCG GCGAGTCCGC CGACGACATC
CGGGGCCTGG TGCACGTGAA GAAGGCAATC GCTGTCCCCT CGGAACGGCG CGCCAACCTC
GAGGCCGGTG CCATCATGAC GGACGTCCTC AGGGTTCCCG AGACAATCCA CCTTGATGCC
CTGCTGGCTG AACTGCGCGA AGGAAACATG CAGCTGGCCG TTGTGCTGGA TGAATACGGC
GGAACGGCCG GCATCGCCAC GCTTGAGGAC CTCGTGGAGG AAATCGTCGG GGAAGTCGCG
GATGAACATG ACAAGGTGCG GCCGGGACTG CTCCAGAGCG CATCGGGGGA CTGGTACTTC
CCCGGCCTGC TGCGCCCGGA CGAGCTGTCC GAACAGATCC CCGGATTGAC GGTGCCTGAT
GAGTCCGCCT ATGAAACCGT GGGCGGGTAT GTCATGAGCC AGCTCGGCCG GATCGCCGCA
GTCGGGGACA CCGTGGATGT CGGCGGGGGC ACCCTGAGCG TGACCCGGAT GGACGGACGC
CGGATCGACC GGATCTGCTT CAAGCCTGCC CCGGTCCTCG GCGAAGAACA CGCTGCCGGC
CAGGGAGGGA CGGCATGA
 
Protein sequence
MEWLLLGAGI LLILGTGFFV AVEFSLVALD QATVQRAVDS GDAAAVPLLK CLKSLSTQLS 
SCQLGITLTT LLTGYVMEPS VGKLLEGPLT AAGVPEAAAA SVSLVMAMTI ATLLSMLIGE
LVPKNMAIAL SLPIGKAVAR PQLVFTAVFK PAIIVLNGFS NKVLNVFGLE AKEEISGART
PAELASLVRR SAAMGTLDPG TANFIARTLK FSGRTAADVM TPRIRLETIG AAQPVSDIID
AARRTGYSRF PVIGESADDI RGLVHVKKAI AVPSERRANL EAGAIMTDVL RVPETIHLDA
LLAELREGNM QLAVVLDEYG GTAGIATLED LVEEIVGEVA DEHDKVRPGL LQSASGDWYF
PGLLRPDELS EQIPGLTVPD ESAYETVGGY VMSQLGRIAA VGDTVDVGGG TLSVTRMDGR
RIDRICFKPA PVLGEEHAAG QGGTA