Gene Arth_1764 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1764 
Symbol 
ID4445698 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp1975347 
End bp1976528 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content65% 
IMG OID639689584 
Productpeptidase M24 
Protein accessionYP_831256 
Protein GI116670323 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.327612 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCTGTTCA GCGAAGAGGA GTACGCGGCA CGCCTTGCCG GCGCACGGGT TCGGATGGAA 
AAGCAGGGCC TGTCCGCCCT GCTGGTTACC GATCCAGCCA ATATTTACTA CCTGACGGGC
TACAACGCGT GGTCCTTTTA CACCCCGCAG CTGGTCTTCG TACCGGCCAG CGGACCCATG
CTCCTCTTCA CCCGGGCCAT GGACGCGGGG GGTGCCTTCC GCACCACCTG GCTTCCGCAG
GAATGCATCA TCGGGTACCC GGAGCAGTAT GTCCACCGGC CGCACGTGCA CCCCTTCGAC
TGGGTGGCCT TCGCCCTCCG CGAACGCTAC CTGATTGCTC CGGCGGCCAA GGGGTGCGTG
GGCCTGGAAA TGGACTCGCA CTTCTTCTCC CCGAAGGCCT ACCGCTCCCT GGTCAATGCC
ATTCCCGAAT GGGCCCTGGT GGACAGCTTC GAGCTGGTGA ACTGGGTGCG CTCGATCAAG
TCTCCCGCTG AGGTGCAGCT GATGCGCGGC GCAGCCCAAG TCTGCATGGG GGCCATGCAG
GCCGCCGTCG AGACCATCGA CGTCGGGGTC CGCCAGTGCG ACGCCGCCGC GGCCATCAGC
CAGGCCCAGA TCAAGGGGGC GAACGGGATC GGCGGTGACT ATCCCGCCAT CGTGCCGATG
CTGCCCACCG GCGAGGCCGC CGATACCCCC CATCTAACCT GGAGCGAGGA CCGTTTCGAG
GCTGGCCAGG CCGTCGTCAT TGAACTGGCC GGCGCCCACC AGCGCTATCA CACGCCTCTG
GCCCGCACCC TGGTCCTGGG CAAGAAGCCG GACCATCTGG CAAAACTGGC CGACGCCGTC
GCCGACGGCC TGAACGCCGT GCTGACCGAG GTGCAGCCCG GCGTTCCTGT CCGGGAACTG
GCCAGGGCCT GGAACTGGAC GCTGGCCAAA TACGGCCTGG AAAAGCCGTC CCGCATCGGT
TACTCGATCG GAGTAGGCTA CCCGCCCGAC TGGGGCGAGC GAACCATCTC CATCCGCTCC
GAGGACGAAT CCATCCTGGC GGAGAACATG ACCTTCCACG TGATCGGCGG GATGTGGATG
GACAACTACG GTTACGAACT CTCCGAATCG ATCCGCGTCA CCGCCGACGG CGTCGAGACC
TTCACCAGCT TCCCCCGCGA ACTCATCCAG AAAGGCGGCT AG
 
Protein sequence
MLFSEEEYAA RLAGARVRME KQGLSALLVT DPANIYYLTG YNAWSFYTPQ LVFVPASGPM 
LLFTRAMDAG GAFRTTWLPQ ECIIGYPEQY VHRPHVHPFD WVAFALRERY LIAPAAKGCV
GLEMDSHFFS PKAYRSLVNA IPEWALVDSF ELVNWVRSIK SPAEVQLMRG AAQVCMGAMQ
AAVETIDVGV RQCDAAAAIS QAQIKGANGI GGDYPAIVPM LPTGEAADTP HLTWSEDRFE
AGQAVVIELA GAHQRYHTPL ARTLVLGKKP DHLAKLADAV ADGLNAVLTE VQPGVPVREL
ARAWNWTLAK YGLEKPSRIG YSIGVGYPPD WGERTISIRS EDESILAENM TFHVIGGMWM
DNYGYELSES IRVTADGVET FTSFPRELIQ KGG