Gene Arth_4014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_4014 
Symbol 
ID4447815 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4530973 
End bp4532682 
Gene Length1710 bp 
Protein Length569 aa 
Translation table11 
GC content62% 
IMG OID639691845 
Productextracellular solute-binding protein 
Protein accessionYP_833489 
Protein GI116672556 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAATC TGACCAAGAT TGGCGGGGCG GCCGCCCTAA CGGCAGCGCT TGCGCTCACA 
GGCTGCGGCG GAGGCGGGAC CTCCTCCGGC CCCGAGGCAG GCAAAGGCCA GGACTCCGGC
AGCGACCTCA CCAAGCTGAT CAGCATCAAT GCCAAGGACG CCAAGGACCT GGAACCGGGC
GGCACCGTGA CCCTTCCGCT GGGCAACATC GGTCCGGACT TCAATGGCTT TTCCAACAAC
GGCAACAGCG CGGACAACAC CGCGCTGCAC CGCCCCATCG ACGGTGCGGG AACGTGGGGC
TGCTGGAACT TCGACTTCGA CGGCACAGCC ACGCCGAACA AGGACTTCTG CGAAGACGTC
AAGAGCGAGG TCAAGGACGG CAAGCAGACC ATCACCATCA AGGTGAACGA GAAAGCCACC
TACAACGACG GCACGCCGAT CGACGTCAAG GCCTTCCAGA ACACCTGGAA CATGCTCAAG
GGTGAAAACA AGGACATCGA CATCGTCAGC TCCGGCGCGT ACGAGTTCGT TGATTCCGTG
AAGGCCGGTT CCAGTGACAA GGAAGTTGTG GTCACCACCA CCCAGCCGGT ATTCCCGCTG
GATGCCCTCT TCACTGGCCT GATCCACCCG GCAGTGAATA CGCCGGAACT CTTCAACACC
GGCTTTACCG GCGACATGCA CCCGGAGTGG ATGGCCGGCC CGTTCAAGCT GGACCAGTAC
GACAGCGCCG CCAAGACCGT GACCCTGGCC CAGAACGACA AGTGGTGGGG CACCAAGCCG
GTCCTGGACA AGGTGGTGTT CCGCCAGCTG GAGACCAGCG CCCAGATTGC AGCGTTCAAG
AACGGTGAAA TCGACGGCGT CTCGGCCAAC ACCATCGCGC TGTACAAGCA GCTCGACGGC
ACCAAGAATT CAGAGGTCCG CCGCGGCCAG CGCCTGTTCG CAGGTGGCCT GAACCTGAAC
GCCCAGAAGG CTCCGATGAC CGACGTCGCC ATCCGCAAGG CGATCTTCAC CGCCGTGGAC
CGCGAAGCAC TCCGGAAGGT CCGCTTCAAC GGCCTGAACT GGGAAGAGAC CAGCTCCGGC
TCAATGATGC TGCTGCCGTT CTCCAAGTAC TACCAGGACA ACTATCCGGC CACGGAATCC
GGTGCCGAAG CAGCCAAGAA GGTACTGACC GATGCCGGCT ATAAGCCCAA CGCAGCGGGC
ATCATGGAGA AGGACGGAGT CCCGGCCGCC TTCAAGATCA GCAACTTCGG TGACGACCCC
ACCACCCTGG CGTTCGTGCA GACCCTGCAG AAGCAGCTCC AGGCCGGCGG CATGGACGTA
GGGATCGACC AGCGCGCCTC CGCCGACTTC GGCAAGGTAC TGGGAAGCCG CGACTTCTTC
CTGAGCGTTT CCGGCTACAC CGTCGGCGCT GATGCGACCG ACGCCGTCAA GCAGTTCTAC
GACTCCAAGA CCAACGAGAA CGGGTTGGGC GACGCGGAGC TGGACGCCGA GATCAAGGCC
CTCAGCAGCA TCGAGGACAA CGCCGAGCGC AACAAGGCGG CCATGGAGGT TGAAAGGAAG
CACATGGCCA AGTACTTCTC CATGGGTGTT GTGATGAACG GCCCGCAGAT CTCGTTCGTC
CGCACGGGCC TGGCAAACTA CGGCCCGTCC CTGTTCAAGA GCCTGTCCCA GGTTCCGGAC
TGGACCAGCC TCGGCTGGGA AAAGAAGTAA
 
Protein sequence
MKNLTKIGGA AALTAALALT GCGGGGTSSG PEAGKGQDSG SDLTKLISIN AKDAKDLEPG 
GTVTLPLGNI GPDFNGFSNN GNSADNTALH RPIDGAGTWG CWNFDFDGTA TPNKDFCEDV
KSEVKDGKQT ITIKVNEKAT YNDGTPIDVK AFQNTWNMLK GENKDIDIVS SGAYEFVDSV
KAGSSDKEVV VTTTQPVFPL DALFTGLIHP AVNTPELFNT GFTGDMHPEW MAGPFKLDQY
DSAAKTVTLA QNDKWWGTKP VLDKVVFRQL ETSAQIAAFK NGEIDGVSAN TIALYKQLDG
TKNSEVRRGQ RLFAGGLNLN AQKAPMTDVA IRKAIFTAVD REALRKVRFN GLNWEETSSG
SMMLLPFSKY YQDNYPATES GAEAAKKVLT DAGYKPNAAG IMEKDGVPAA FKISNFGDDP
TTLAFVQTLQ KQLQAGGMDV GIDQRASADF GKVLGSRDFF LSVSGYTVGA DATDAVKQFY
DSKTNENGLG DAELDAEIKA LSSIEDNAER NKAAMEVERK HMAKYFSMGV VMNGPQISFV
RTGLANYGPS LFKSLSQVPD WTSLGWEKK