Gene Arth_0391 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_0391 
Symbol 
ID4447146 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp416996 
End bp417991 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content69% 
IMG OID639688187 
Producthypothetical protein 
Protein accessionYP_829892 
Protein GI116668959 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism
[R] General function prediction only 
COG ID[COG0697] Permeases of the drug/metabolite transporter (DMT) superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAACCA CCGCCGAATC CGGCCCCACG CCGTCCGCCG GCCCCCACGA AGGGTCTCCG 
CACGCCGCAC CAGCACCGCA GCCCCCGTCA CACGGCAACA CCTCCGTGCC GCTTCCCTCC
CCAGGCAAGA CGCCGGTCAA CAGGCTGGGC GTCGCCGCCG TCCTCGTCAC CGTGGTCCTC
TGGGCCTCCG CGTTCGTGGG CATCCGCGCC GTGGGCCCAA GCTTTTCACC CGGTTCCCTC
ACCCTGGGAC GGCTGGCGAT TGCCGCCGTC GTCCTCGGCC TGGTGGTGCT GCCCAAGCTG
CGGATACTGC CCAAGGGCCG GGAGTGGTGG CCCATCCTCG CCTACGGCGT GATGTGGTTC
GGCGGCTACA ACGTAGCCCT CAACGCCGCC GAGCACCTGC TGGACGCCGG CACTGCCGCG
CTGCTGATCA ACGTCAACCC CATCCTGGTG GCGGTGATGG CCGGCCTGCT GCTCAAGGAG
GGTTTCCCGC GCTGGCTGAT CATCGGGAGC CTGATAGCGT TCGCCGGCGT CGCGGTAATA
GCGCTCAGTT CCGGACAGCG TTCGACGGCG GATGTGGCCG GCGTGCTGCT CTGCCTCCTG
GCCGCTGTGC TCGCCGCCGT CAGCGTGATC ATCCAGAAGC CGGTGCTGAG GAAGTTCCCC
GCGGCCCAGG CCACCTGGTT CGGCATCATG GTGGGTGCAG TCTGTTGCCT GCCGTTCAGC
GGCCAGCTGG TGGCCGAGCT GCAGCAGGCT CCGCCCCCGG CCACCCTGGG GCTGGTCTAT
CTGGGAATCT TCCCGACGGC GATTGCCTTC ACCACGTGGG CGTACGCGCT GTCCCTCATC
GACGCCGGGA AGCTGGCCGC CACCACGTAT CTGGTGCCCG GCACCACCAT CCTCATTTCC
TGGCTGGTGC TCGGCGAAGT GCCCACTGTG TGGGGCCTTG TGGGCGGCGT GATCTGCCTG
GTGGGAGTGG GGCTGACGCG GCGCAAATCC CGCTAG
 
Protein sequence
MATTAESGPT PSAGPHEGSP HAAPAPQPPS HGNTSVPLPS PGKTPVNRLG VAAVLVTVVL 
WASAFVGIRA VGPSFSPGSL TLGRLAIAAV VLGLVVLPKL RILPKGREWW PILAYGVMWF
GGYNVALNAA EHLLDAGTAA LLINVNPILV AVMAGLLLKE GFPRWLIIGS LIAFAGVAVI
ALSSGQRSTA DVAGVLLCLL AAVLAAVSVI IQKPVLRKFP AAQATWFGIM VGAVCCLPFS
GQLVAELQQA PPPATLGLVY LGIFPTAIAF TTWAYALSLI DAGKLAATTY LVPGTTILIS
WLVLGEVPTV WGLVGGVICL VGVGLTRRKS R