Gene Arth_2788 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2788 
Symbol 
ID4444534 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3139834 
End bp3141426 
Gene Length1593 bp 
Protein Length530 aa 
Translation table11 
GC content67% 
IMG OID639690610 
ProductXaa-Pro aminopeptidase 
Protein accessionYP_832267 
Protein GI116671334 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.364809 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAACGATG CAGACAACAC CTACAACGGT GAAAATACAG CAAGCCAGCC CCTCGAAGAG 
CGCGTCAACA ACCGCTCGCA GCGGCCCAGT TCCGACGCCT TCAAGGCCTT CATGGCCAGC
AACTGGGCGC CCTCGGCACA AGAGCTCCCC GATCGGGACG CCGTGGCCGA TCACGCCGCC
GCCCGGCGTC GCACCATTTC CGGCCTGTTC AAGGGCGAAC GCCTGGTGGT CCCCGCCGGC
CCGCTGAAGG TCCGCTCCAA CGACTGCGAC TACCGCTTCC GCCCCCACTC CGGTTTCGCC
CACCTGACGG GCCTGGGCCT CGACCACGAG CCCGACGCCG TGCTCATTTT CGAACCGGTT
GAAGAAGGAA AGGGCGACGA CGGCGGGAAC CACCGCGCCA CCCTTTACTT CCGGCCCCTC
GCCGGCCGGG ACACCGAACA GTTCTATGCA GACTCCCGCT CCGGCGAATT CTGGATCGGT
GCCCGCCCCA CGCTGGCAGA ATTCGAACGC AGGCTGGGCC TCGCCACTGC CCACATCGAC
GAGCTCGAAC TGGCAATCAC CAAGAATGTG GGCGCCCCCG AAATCGGCGG TATCTCCATC
CGGCTGGTGC GCAAGGTGGA CGAGAACATC GACGCCCTGG TGGACACGGC CCGCTACAAC
ACTGCCAAGG ACCCGGACAA CCTGGACCTG GGCGTGCTGG ATGCCCTTGA TGAGAAGCTC
ACCGAGGCCC TCTCCGAGCT CCGCCTGGTC AAGGATGCGT GGGAAATCGA GCAGATGAAG
ACCGCCGTGG CCGCGACCGT GGAAGGGTTC ACCGAGGTCG TCAAGGCCCT CCCCCGGGCC
CTGACCCACC GGCGCGGCGA GCGCGTCGTC GAGGGAGCCT TCTTTGCCCG TGCCCGGGAA
GAGGGCAATG AGCTGGGCTA CGACACCATC GCGGCCTCGG GCAACAACGC CACCGTGCTG
CACTGGACGC GGAACACCGG AACGGTCAAC GCCGGCGAGC TCCTGCTGCT GGATGCCGGC
GTTGAGGCCG ATTCCCTCTA TACGGCTGAC ATCACCCGTA CCCTGCCCGC CAACGGCACG
TTTACCGAGG TCCAGCGCAA GGTCTACGAG GCTGTCCTGG ACGCAGCGGA CGCCGGCTTC
GCCGCCGCGC AGCCCGGCAC CAAGTTCCGC GACATCCACA CGGCCGCCAC CACTGTCCTC
GCTGAGCGCC TGGCGGAATG GGGCCTGCTG CCCGTGTCCG TCGAGGAAGC CATCAGCCCC
GAGGGCCAGC AGCACCGCCG CTGGATGCCG CACGGCACCA GCCACCACCT TGGCCTCGAT
GTGCACGACT GCGCCCAGGC CAAGCGTGAG CTCTACCTGG ACGGCGTCCT GACCCCGGGA
ATGGTGTTCA CGATCGAGCC GGGCCTGTAC TTCAAGAACG AGGATCTCGC GATTCCGGCG
GAATACCGCG GCATTGGCGT CCGGATCGAG GACGACATCC TCATGACTGC CGACGGTCCG
GTCAACCTCA GCGCCGCACT CCCCCGCAAG GCCGACGACG TCGAGTCCTG GATGGCGGGC
ATCTACCAGG AAGCAGAGCA CGCACAGCCG TAA
 
Protein sequence
MNDADNTYNG ENTASQPLEE RVNNRSQRPS SDAFKAFMAS NWAPSAQELP DRDAVADHAA 
ARRRTISGLF KGERLVVPAG PLKVRSNDCD YRFRPHSGFA HLTGLGLDHE PDAVLIFEPV
EEGKGDDGGN HRATLYFRPL AGRDTEQFYA DSRSGEFWIG ARPTLAEFER RLGLATAHID
ELELAITKNV GAPEIGGISI RLVRKVDENI DALVDTARYN TAKDPDNLDL GVLDALDEKL
TEALSELRLV KDAWEIEQMK TAVAATVEGF TEVVKALPRA LTHRRGERVV EGAFFARARE
EGNELGYDTI AASGNNATVL HWTRNTGTVN AGELLLLDAG VEADSLYTAD ITRTLPANGT
FTEVQRKVYE AVLDAADAGF AAAQPGTKFR DIHTAATTVL AERLAEWGLL PVSVEEAISP
EGQQHRRWMP HGTSHHLGLD VHDCAQAKRE LYLDGVLTPG MVFTIEPGLY FKNEDLAIPA
EYRGIGVRIE DDILMTADGP VNLSAALPRK ADDVESWMAG IYQEAEHAQP