Gene Arth_3603 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3603 
Symbol 
ID4443914 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4044650 
End bp4046590 
Gene Length1941 bp 
Protein Length646 aa 
Translation table11 
GC content63% 
IMG OID639691427 
ProductTRAP dicarboxylate transporter, DctM subunit 
Protein accessionYP_833078 
Protein GI116672145 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1593] TRAP-type C4-dicarboxylate transport system, large permease component
[COG3090] TRAP-type C4-dicarboxylate transport system, small permease component 
TIGRFAM ID[TIGR00786] TRAP transporter, DctM subunit 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGACAG TCTGGAAGCC CATGAAAAAC ATCACTACGC CCGAGGAACT CGAGGAGGTC 
CTCCCCTCGG ACGCGGAAGA GATACTCCAC CACGGCCATG TACCCCCGCG TTGGAGCGGT
GCCCTGTGGC TAGACAAAAC GCTTGAATGG GTTGTGGGCG CTGCCATCCT CGCGGAGCTC
GTCGTGATCC TGCTGAACAT CATGGTGAGG GTGGTCACCG GCGATTCCGT GCTCTGGACC
CAGGAAGTCT CCGAGATCGC GCTGCTGACC ATCGCTTTCA TCGGCGGCGC CATCGCGTAC
CCCAAGGGCG CGCACATGTC CGTCCAGGCC CTCATCATGC GCCTCCCCGC CACTTGGAAG
CCTTACCTGG CCGCCCTGGT CGACTGGCTG GTGTTCATCA TGAGCGCAGG CGCATTTGCG
CTGTTCGTTC CCACCCTTGT CCAGCAGATC GAGGAAAAGA CCCCGATCCT GCAGCTGCCC
GTTTTCTGGG TCTCACTGCC CTTCTCCGTC GGCATGGTGC TGATCGCCTG GTTCGCCCTC
CTCAAACTGT GGCGCCAGGA CCGCCGGCCG GCACTCATTG CTGCGGGAAT CGCCGCCGGA
CTGATTGTTC TCGTCCTTGT GGCCCAGCCG CTCTTCTACT ACGCCACCCC CAACGTCCTG
CTCGGCGTCG TCCTCCTGCT TTTGTTCATG CTGCTGTTCC TGGGCCTGCC CATCGCATTC
GTCCTTGCCC TGGCATCCGG GATCTACCTT TACCTGGGTG GCATTTCCGA GGTCAGCGCC
ATCCCCATCG GAATGGCTTC CGGGGCTAAG GGCTTCGTCC TTCTGGCGAT CCCGTTCTTT
ATCCTTGCCG GCACGGTAAT GAACTCCGCC GGCCTGACCC TTCCGCTGGC CAAGCTAGTC
GATGCCCTGA TCGGGCACCT GCGCGGTGGC CTGCTCCAGG TGGTCGTCGT GACCATGTAC
ATCTTCTCCG GCATCTCCGG CTCCAAGGTG GCCGACGTCG CAGCAGTGGG CACCACGATG
CGCGGCATGC TCGAAGAACG AAAGTACCCC CGTGGAGAAG TCGTCGCCGT CCTGTCGGCC
TCGGCCATTA TGGGCGAAAC CATCCCGCCG AGCATCGTGC TCCTGATCCT TGGATCCATC
ACCACCATCT CCACCACCAC GCTGTTCCTG GCCGGCTTCG TTCCCGCCGC CTTCCTGGCC
CTCTGTGTCA TGGCGCTCGT CTTCTTCCGC GCCCGCAAAC AGGGCGGCGT CGCCAGCCCC
AAATCCAGCT GGCGTGAACG CGGATCGGCA ACCTTCTTCG CAATTCCGAC TCTTCTGCTT
CCTGTGGGCA TGGTGGTGGG AATCCTCAGC GGCTTTGCCA CACCCACTGA AGTGTCGTCC
GTGGCCGTCG CCTACGCCTT CATCCTCGCC GCCGCTTACC GGCGCGGCAG CAAGCGGCTG
CTGGGTGACA CGCTGCGGGA AACCACGACG ACGGCGGGAA TGGTGTTGTT CATCATCGCG
GCGGCGTCGC CGCTGGCCCA GACCCTCGCG CTTGCAGGTG TCTCCCAGCA GATCCACGAC
CTCATGTCAG GGCTCGGCGA CTCACCACTG CTCTTTATGC TGTTCACCAT CGTCCTGCTG
ATCATCATGG GCCAGCTCCT TGAAGGCCTC CCCGCGGTAC TGATCTTCGC GCCCCTGCTC
CTGCCGATCG CCGTCGACTT CGGCGTCAAC CCCGTGCAGT ACGCGATGGT ACTGATCATC
TCCATGGGTA TCGGCTCATT CGCACCGCCA GCCGGCGTCG GCTTCTACGT CGCCTGCGCA
ACCGCCCACG AAACTGTCGA GAAGAGCCTC AAGCACTTCT GGCCCTACCT CATCGCCGTG
TTCCTCGGGC TCCTCGTTCT TGCCGCAGTC CCGTGGTTCA GCACATTCCT TCCCGCCCTC
GCCGGACTGA TCCCCTTCTA A
 
Protein sequence
MSTVWKPMKN ITTPEELEEV LPSDAEEILH HGHVPPRWSG ALWLDKTLEW VVGAAILAEL 
VVILLNIMVR VVTGDSVLWT QEVSEIALLT IAFIGGAIAY PKGAHMSVQA LIMRLPATWK
PYLAALVDWL VFIMSAGAFA LFVPTLVQQI EEKTPILQLP VFWVSLPFSV GMVLIAWFAL
LKLWRQDRRP ALIAAGIAAG LIVLVLVAQP LFYYATPNVL LGVVLLLLFM LLFLGLPIAF
VLALASGIYL YLGGISEVSA IPIGMASGAK GFVLLAIPFF ILAGTVMNSA GLTLPLAKLV
DALIGHLRGG LLQVVVVTMY IFSGISGSKV ADVAAVGTTM RGMLEERKYP RGEVVAVLSA
SAIMGETIPP SIVLLILGSI TTISTTTLFL AGFVPAAFLA LCVMALVFFR ARKQGGVASP
KSSWRERGSA TFFAIPTLLL PVGMVVGILS GFATPTEVSS VAVAYAFILA AAYRRGSKRL
LGDTLRETTT TAGMVLFIIA AASPLAQTLA LAGVSQQIHD LMSGLGDSPL LFMLFTIVLL
IIMGQLLEGL PAVLIFAPLL LPIAVDFGVN PVQYAMVLII SMGIGSFAPP AGVGFYVACA
TAHETVEKSL KHFWPYLIAV FLGLLVLAAV PWFSTFLPAL AGLIPF