Gene Arth_1663 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1663 
Symbol 
ID4445820 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp1858005 
End bp1859336 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content70% 
IMG OID639689478 
Productsecreted protein 
Protein accessionYP_831157 
Protein GI116670224 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCATCCA TCCCGGGGCT GCGCCCGGCC ACCCCAGCGA ATCCGCCCCC TGCCCTGGAC 
AGGGTCCGAA CCGCCGCCGG CGACGGCATG TCTTCCAGCG CGAAGTGGGC CGTGGGCGGA
GTCATCGCGG GAGGTGCGGC TGCGGGTCTG CTGGGCGCCG GATCCTCGGC CCTTGCCCTC
TATTTCGCCC GCCGCGTCAT AACGCCGGTG CGGGTCCGGG ACGAGAACCA GGAGGTGCTG
GCCGTCATCC GTGCCGGGGA CGGCCTCCAG GTCATCCTTG CCGCCACCGA CGACGCCACC
GTGGAAGGCG TGTACGGCTT CTTCTTCGAC GGCGGCCGCG GGCACGCCCG GATCGGCAGG
ATCGTTTCCT ACTCACCTGC GGAACGCACC GTGCTGCGCG AGGTCGAGGC CGTCTATGCC
GGCGATCTCA CCACGGCCCG CCGTGGCTAC TGGAGCGGCG CCGCCTATCC GGATGCCGCG
TCCATCGGGC TGTCGGCCGA GGACGTGGAA GTCGACGTCG AGGGAGGGAC TGCGCCCGCC
TGGCTGGTGC GCGCGGCGGC ACCGTCGGAC GTCTGGGCCA TCATGGTCCA CGGCCGCGGC
GCCAGCCGGC AGGAGTGCCT GCGCGCCCTG CGCCCGGCCC GGGAGCTCGG CCTGACCAGC
CTGGTGATCT CCTACCGCAA CGACGGGCTG GCGCCCTCGG CCCCCGACGG GCGGTACGGC
CTCGGTTCCA CCGAATGGCG CGACGTCGAG GCAGCCATCA GTTTTGCCAT AGCCAACGGC
GCCAGGGAAG TGGTGCTGTT CGGCTGGTCC ATGGGCGGGG CGATCTGCCT GCAGACGGCG
GACCTTTCAC GGCACCACAA CCTGATCCGG GCCATGGTGC TTGACGCTCC GGTGGTGGAC
TGGGTGAACG TGCTGGCGCA CCATGCGCAG CTGAACCGGA TCCCCTCTGC CGTGGGACGC
TACGGACAGC TGATGATGGG CCACCCGCTG GGCAGGCGCC TGACCGGGCT GGCGGCACCG
GTGGACCTGA AATCGATGGA CTGGGTCTCC CGCGCCGTGG AACTCAGAAC GCCCACGCTG
ATCCTGCACA GCGTCGACGA CGAATATGTA CCCTACGGAC CCTCTGCCAG CATCGCAGAG
AAGAACCCCG AGATGGTCAC CTTCGAGACG TTCCAGACAG CCCGGCACAC CAAGGAATGG
AATGTCGATC CGGAGCGCTG GGAAGGCTTG GTCACCGCCT GGCTCCGCCG GCAGCTGGCG
CCGCGCGCCA ATCCCGGCGC CCGCACCCGG GGCCCCGACC CCGCCCGGCC CGACGTTAAC
GGTCCGGAGT AG
 
Protein sequence
MASIPGLRPA TPANPPPALD RVRTAAGDGM SSSAKWAVGG VIAGGAAAGL LGAGSSALAL 
YFARRVITPV RVRDENQEVL AVIRAGDGLQ VILAATDDAT VEGVYGFFFD GGRGHARIGR
IVSYSPAERT VLREVEAVYA GDLTTARRGY WSGAAYPDAA SIGLSAEDVE VDVEGGTAPA
WLVRAAAPSD VWAIMVHGRG ASRQECLRAL RPARELGLTS LVISYRNDGL APSAPDGRYG
LGSTEWRDVE AAISFAIANG AREVVLFGWS MGGAICLQTA DLSRHHNLIR AMVLDAPVVD
WVNVLAHHAQ LNRIPSAVGR YGQLMMGHPL GRRLTGLAAP VDLKSMDWVS RAVELRTPTL
ILHSVDDEYV PYGPSASIAE KNPEMVTFET FQTARHTKEW NVDPERWEGL VTAWLRRQLA
PRANPGARTR GPDPARPDVN GPE