Gene Arth_2819 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2819 
Symbol 
ID4444615 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3168657 
End bp3170372 
Gene Length1716 bp 
Protein Length571 aa 
Translation table11 
GC content64% 
IMG OID639690641 
ProductABC transporter related 
Protein accessionYP_832298 
Protein GI116671365 
COG category[R] General function prediction only 
COG ID[COG1123] ATPase components of various ABC-type transport systems, contain duplicated ATPase 
TIGRFAM ID[TIGR02323] phosphonate C-P lyase system protein PhnK 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.810211 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGCCT CTGTCGAAAT CCACGAAGCC GGCATCATTG CCGAACGCCC GCTCCTTGAA 
ATCAAGGATC TGGCGATCAA TTTCAGCACC AGCAACGGCG AGGTCAACGC CGTCCGGAAC
GCCCACCTCA CGGTGATGCC CGGTGAGACG GTTGCCATCG TCGGCGAGTC TGGTTCGGGC
AAGTCGACGA CGGCGCTTGC CGCCATCGGC CTGCTTCCAG GAAACGGCCG GGTGGCGGCC
GGGCAGGTCT TCTTTGACGG TGAGGACATT GCCCACGCCT CCGAGAAGCG CATGATCGAA
TTGCGCGGCA ACAGCATCGG AATGGTGCCT CAGGACCCGA TGTCCAACCT CAACCCTGTC
TGGAAGATCG GTTACCAGGT CCGCGAAACA CTGAAGGCGA ACGGGCTGCC CAGCGGACCC
GACGACGTCG CGAAGGTGCT TTCCGAAGCA GGGCTTCCCG ATGCGGCGCG GCGCGCGAAG
CAGTACCCGC ACGAGTTTTC CGGTGGCATG CGTCAGCGTG CCCTGATCGC GATTGGCCTC
TCCTGCCAGC CGCGCCTGCT GATCGCCGAT GAGCCGACGT CGGCCCTGGA CGTGACCGTG
CAGCGGCAGA TCCTGGACCA CCTGGACAAG ATGACCACCG AACTCGGGAC GGCCGTACTG
CTGATTACGC ACGATCTCGG CCTCGCCGCC GAACGGGCGG ACAAGGTGGT GGTGATGTAC
CGGGGCCAGG TGGTGGAGTC GGGTCCGTCG CTGGAGCTGC TGCAGAACCC GCAGCACCCG
TACACGCAGC GCCTGGTTGC CTCGGCGCCG TCTCTCGCTT CACGCCGCAT CCAGGTGGCC
AAGGAGCTCG GAGTCGAATC CGATGAGCTG CTTGCCCCCA CCGACGCCGT GGTTGTCGAG
CCCACCCAGC CTGAGGCTGT GCTCCAGATC CAGAACCTCC GCAAGGTGTT CAAGCTCCGT
TCCGGCTTCG GTAAGTCCAC CGACTTCACC GCCGTGGACG ACGTCTCGTT CAGCGTCAAA
CGCGGGACGA CGACGGCGAT TGTGGGGGAG TCGGGTTCCG GCAAGTCCAC TGTGGCGCAG
ATGGTGCTTA ACCTCCTGCA GCCGACGTCG GGCAAGATCG TGTTCGACGG CGTGGATACG
TCCACGCTCA ACAACAAGGA GATCTTCGCG TTCCGCCGCC GCGTGCAGCC GATCTTCCAG
GATCCGTACG GTTCCCTCGA CCCGATGTAC AACATCTTCC GGACTATCGA GGAACCGCTG
CGCACCCACA AGATCGGGGA CAAGGCAAGC CGCGAGAAGA AAGTCCGCGA GCTTCTGGAC
CAGGTGGCAC TGCCGCAGTC CACCATGCAG CGGTACCCGA ATGAATTGTC CGGCGGTCAG
CGACAGCGTG TTGCCATTGC CCGTGCACTG GCACTCGATC CGGAAGTGAT CATCTGCGAC
GAGGCGGTCT CCGCCCTCGA CGTGCTGGTC CAGGCGCAGG TGCTGAACCT GCTCGCTGAG
CTCCAGTCCA GGCTGGGCCT GACCTACCTC TTTATCACGC ACGACCTCGC CGTGGTCCGG
CAGATTGCCG ACCACGTCTG CGTGATGGAA AAGGGCAAGC TGGTTGAAAC CAGTTCAACG
GATGACGTCT TCGACGCGCC GCAGATGGAC TACACGAAGG CACTGCTCAA CGCCATTCCG
GGGGCGCGGC TCATGCTGCC CCCCGAGGTT GCCTAA
 
Protein sequence
MNASVEIHEA GIIAERPLLE IKDLAINFST SNGEVNAVRN AHLTVMPGET VAIVGESGSG 
KSTTALAAIG LLPGNGRVAA GQVFFDGEDI AHASEKRMIE LRGNSIGMVP QDPMSNLNPV
WKIGYQVRET LKANGLPSGP DDVAKVLSEA GLPDAARRAK QYPHEFSGGM RQRALIAIGL
SCQPRLLIAD EPTSALDVTV QRQILDHLDK MTTELGTAVL LITHDLGLAA ERADKVVVMY
RGQVVESGPS LELLQNPQHP YTQRLVASAP SLASRRIQVA KELGVESDEL LAPTDAVVVE
PTQPEAVLQI QNLRKVFKLR SGFGKSTDFT AVDDVSFSVK RGTTTAIVGE SGSGKSTVAQ
MVLNLLQPTS GKIVFDGVDT STLNNKEIFA FRRRVQPIFQ DPYGSLDPMY NIFRTIEEPL
RTHKIGDKAS REKKVRELLD QVALPQSTMQ RYPNELSGGQ RQRVAIARAL ALDPEVIICD
EAVSALDVLV QAQVLNLLAE LQSRLGLTYL FITHDLAVVR QIADHVCVME KGKLVETSST
DDVFDAPQMD YTKALLNAIP GARLMLPPEV A