Gene Arth_3332 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3332 
Symbol 
ID4444061 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3743898 
End bp3745013 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content70% 
IMG OID639691155 
Producthypothetical protein 
Protein accessionYP_832807 
Protein GI116671874 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACACCG CTACCCAACA GGCACAGCAG GCGACCACCG GCACCACCAC GGGACCGTGG 
AACCCCGCAG ACCGGGCCGT CAAGCGCCGG CGGGTCCTGG ACATCCTGGA TGCCGCGGGC
AGGGACTCCC TGCTCCTGAC CACCAACACG GCGCTGACCT GGTACCTGGA CGGGAGCCGC
GTCCACATCA GCCTCGCCGG CGACCCCGTC GCCGCCATGC TGGTCGACCG TGACGGTGAC
CATCTGGTCA CCTACAACAA CGAAGCCGCC CGGATTGCGG CAGAGGAACT GCCCGACGGC
GTGAACCTCC ACACGGTGCC CTGGCACGGG CAGCTGCACG CCGCCGCAGC CATGCTCGCA
CCTGACGGAA GGCCCCTTGC GGAGACTGAT GTGGCCGCCG AGCTGAGGAC CGCCCGCCAG
CCGTTCCTGC CCGGCGAGAG CGCCCGGTAC GCCCGGCTGT GCGCCGATGC CGCTGCAGCG
ATGACAGCCG TCCTTTCCGG CACCACCCCG GAAACCACCG AGTTCGCTGT GGCCTCCGCC
CTGGCTGCAC GGATCGTGGC GATGGGGGCC GAGCCGCTGG TGCTTCTGTG CAGCGGCGCC
GGACGCAGCG GGTTCCGGCA CCCGCTGCCT ACCCACGCGC CGATCGGCCG GCGGGCCATG
GCGGTGGTGT GCGCGCGGCG CAACGGACTG GTGGCCAATG TGACCCGCTG GGTGCGGTTT
GACGCCGGAA CCCCGGGCGA ACTCGACGCC GAAGCCCGGA TTGCCGCAGT AGAAGCGGAC
ATTTTCGACG CCACTGTGCC CGGTGCACGG TTGGACGGCA TCTTCGCTGA AATCCAGGAA
GCCTACCTCC GCCACGGCTT CGGCGCAGAC CAATGGACCC TCCACCATCA GGGCGGCCCG
GCCGGTTACG CGGGCCGCGA TCCCCGGGCG ACGCCCGGCA CCGACGATGC CGTGGTCCTC
AATCAGACCT TCACCTGGAA TCCTTCCGGT CCCGGAGTGA AGATCGAAGA CACGGTCCAG
CTGACGGAGA CGGGGATCAC CGTCCTCAGC GTGGACCCGA ACTGGCCGGC CGCCGTCGTT
AACGGCATCC GGCGGCCGCT GACCCTGGAG CTGTGA
 
Protein sequence
MNTATQQAQQ ATTGTTTGPW NPADRAVKRR RVLDILDAAG RDSLLLTTNT ALTWYLDGSR 
VHISLAGDPV AAMLVDRDGD HLVTYNNEAA RIAAEELPDG VNLHTVPWHG QLHAAAAMLA
PDGRPLAETD VAAELRTARQ PFLPGESARY ARLCADAAAA MTAVLSGTTP ETTEFAVASA
LAARIVAMGA EPLVLLCSGA GRSGFRHPLP THAPIGRRAM AVVCARRNGL VANVTRWVRF
DAGTPGELDA EARIAAVEAD IFDATVPGAR LDGIFAEIQE AYLRHGFGAD QWTLHHQGGP
AGYAGRDPRA TPGTDDAVVL NQTFTWNPSG PGVKIEDTVQ LTETGITVLS VDPNWPAAVV
NGIRRPLTLE L