Gene Arth_3604 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3604 
Symbol 
ID4443915 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4046654 
End bp4047685 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content60% 
IMG OID639691428 
ProductTRAP dicarboxylate transporter, DctP subunit 
Protein accessionYP_833079 
Protein GI116672146 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component 
TIGRFAM ID[TIGR00787] tripartite ATP-independent periplasmic transporter solute receptor, DctP family 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATCCA AGACCCGCGC CCTCACCGGA ATGCTGGCAT GCACCCTCGC CGCCTCAGTC 
CTCGCCGGAT GTGCCTCAGG GGCCTCCGGC GGGACTGGAG CGCCGGTGGC CAAGATCCAG
CTCTCCATCC CCGATCCGCT CACGTCCTCC GTTGGTGTCT CGGCCCAGCA TTTCGCCGAC
CAGGTCAAGA AGACGTCCAA CGGGTCCGTG ACTGTCACCG TAGTCCCCAA CGGAACGAGC
TTCAGTGGAG ACCAGAACGC GGCTGTGACA CGCCTGCAGG GCGGTTCCCT CGATGCACTT
ATCCTGTCGA CGTCCGTCTA CGCAGCCGTC GTACCTGAAA TGAACGCCAT CAGCATTCCG
TTCCTCTTCA AAGACACCAC CGAGGAAGCG GCGTTCCTGG CAGGAAAGCC GGGCCAGGCG
CTGAAGGAAA AGCTTGCCGC CAAAGACACG GTCGCGCTTT CCTTCCTCAC CCGGACCGGC
CGGGAAATTA CGAACTCCGT TCGCCCCATC GAACAGCCCT CGGATCTGAA GGGACTTAAG
ATCCGGGTTC CAGGCAACCC GTTGTGGACC GACTACTTCA GCAAGCTCGG CGCCAGCCCC
ACCACCATGG CATTCTCTGA GGTCTTCACC GGTCTGCAGA CCGGAACGAT CAACGGCCAG
GAAAACCCGA TCGAGGTGCC GTGGACGAAC AAGTTCTCAG AAGTCCAGAA GTACATTTCG
ATGACCAACC ACATCAACGA CGCCTGGGTA CTCGCCCTCT CCTCGAAGAA GTGGGACACC
CTCACCGATG AGCAGAAGAA GGCCCTCACG GACGCCTCTG AGGAAACCGC CACCTTTAAG
ACCGGTTACG ACGCCGAGCA GTCCAAGAAG CAGCTCGAGG AACTCACCGC CAAGGGCATG
AAATCCAACG AGCTCAGCGC CTCCGGTCTG GAGGAATTCA AAGCGGCTTC CAAGAGCCTT
TACCCGACCT TCTCCCAGCT GATCGGCAAG GACTTCTTCG ACCAGGCCAT CGCTTTCACC
ACCACCAAGT AG
 
Protein sequence
MKSKTRALTG MLACTLAASV LAGCASGASG GTGAPVAKIQ LSIPDPLTSS VGVSAQHFAD 
QVKKTSNGSV TVTVVPNGTS FSGDQNAAVT RLQGGSLDAL ILSTSVYAAV VPEMNAISIP
FLFKDTTEEA AFLAGKPGQA LKEKLAAKDT VALSFLTRTG REITNSVRPI EQPSDLKGLK
IRVPGNPLWT DYFSKLGASP TTMAFSEVFT GLQTGTINGQ ENPIEVPWTN KFSEVQKYIS
MTNHINDAWV LALSSKKWDT LTDEQKKALT DASEETATFK TGYDAEQSKK QLEELTAKGM
KSNELSASGL EEFKAASKSL YPTFSQLIGK DFFDQAIAFT TTK