Gene Arth_3624 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3624 
Symbol 
ID4443935 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4063229 
End bp4066129 
Gene Length2901 bp 
Protein Length966 aa 
Translation table11 
GC content71% 
IMG OID639691448 
ProductABC transporter related 
Protein accessionYP_833099 
Protein GI116672166 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1129] ABC-type sugar transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCTTCC GACCCGCGCC GCTACGTGCA GGGGCGGCCC TCGCCGTCGT CTTCATCGTG 
GCGCGGGTCA TCTACCGCGT CCTTTTCAAC GGGGCAGGCA TCGCCGAGCC GGTTCTGCTC
GACCTGCCGG CGATTCGGCT GCCTGCCCCG TACGCCCATG TGGTCCTCCT CGGCCCGGTC
ACGGCGCCCG GACTGTGGGC GGCGGTCCTG TCCGCCCTGC CGATTGCAGG AATGTTCCTT
GGTTTCGGCC TGCTCAACGC CTGGGTGGAC GTGGCCCGCG GCTTTGTGCA CCTGGCCCGC
CGCGGCCCCA TGCAGGGCTT GGCCCGGACG CTTGTGGTGG CCTGGGCGGC ACTCCCCGCG
CTTTCCGACG CCGTGACCTC CGTGCGCCTG GCATTCCGGT TGCGGGGCGA ACGCTTCGGG
CCCCGAGCCC TTGTTCCCGT GCTGGAGCGC ACCCTTGAGC ACGCGGCCCG GGTTGCCGCG
GCCCTGGAGC TGCGTGGATT CGGGAGCCGG GCGGCACCCC AGCCCGGCAG CGGTACCGAA
ACACCGCTTT TCGTCCGGAA TGCCGAGTTC CGCATTGGTG ACGCGCAGGT GCGCGTCACG
GAATTCACGC CGTCTAGCGG GTCCATGACG GTGATCACCG GGCCCACGGG CTCCGGCAAG
TCCACTATCC TGCGAGGTAT CGCGGGCCTC CTCTCCCACG TTGACGGCGG AGAAATTACC
GGGACCGTGC GCGTCGCCGG AGCGGACCGT TCCGCCACGC CGCCGCGCGA TACCGCCCGC
CTCGTCGGCG TCGTATTGCA GAATCCCCGC GCCGCATTCG CCACCACCCG GGTCCGGGAC
GAAATCGCCC TGGCCCTTGA GCTGCGCGGC ATGGCATCCG GTGCCGCCAA GGCCCGGGTG
CTGGAGATTG CGGAGAGCAT TGGAGTCTCG GCCCTGCTGG ACCGGAATGT CAGCACCCTC
TCGGCGGGCG AGGCGACGCT CGTTGCCATC GCCGCTGCCG TGGTGGAGCA GCCTGCCCTC
CTCCTGGTTG ATGAGCCCCT GGCCGACCTT GACACCGCAG CACGCGGACA CGTCATCGCG
GTGCTTCACG CCCTCGCCCG GGACGCCGGT GTCTGCGTCA TCGTGGCGGA GCACCGAGCG
GAGCCGCTCG TCCCGGTCGC CGATTCGTGG TGGACTATCG ACGACGGCGA CCTGGTGCCG
GGTGCCGCAC CATCCCCGCC GCCGCGCCTT GTCGACGCCG GTCCAATGCC GGCAGAGCCT
GCGGACTTCG CGCCCGTACT GACGGCGACA CAGCTTGCGG TGCACCGCAA GGGCACGCCG
CTGGTGCGCG ACGCATCCCT GACCCTGCAC CGCGGTGAAG TGGTGGCCCT GGTGGGGCCG
AACGGGGCCG GGAAGTCGTC CCTCCTGGTG GCGCTAGCCC TTGGTGAGGG CATGGTGAGT
GAAGGAACTG GCAGTGCAGG AAAGCGCGAC GTCGGCACGG TGGACGGCGG AAAGGCCGAC
GGCGGGCGCG TCGCCCTGGT CCCGGACGCC TCGGACGACC TCTTCACGCG GGATACCGTC
GCCGGCGAGC TCCGCGCGGC AGAGCGCCGC CTTGCGCGCC GAAAGAACGG CGGACAGCCC
GCGCCCGGCT TTGCGGCGTC GCGCCTGGCC CGTCTGCGGG GCGACGTCAC GATCCCCATT
GGCCAGGAGC ATCCGCGGGA CCTCTCTGCG GGGGAACGCA GGATCCTGGC GATCGTGCTC
CAGACCATGG ACGACCCCCG GGTGCTCCTG ATCGATGAGC CGACCCGCGG ACTGGATCCC
GCGGCGCGCA CGGCAGTCTC GGCGGCGCTC CGGGCTGCAG CGGATTCCGG TGCGGCGGTC
CTGATCGCCA CGCACGACCT CGACTTTGCC CACAGCCTTG GCGCCCGGAT CCTCCCGATG
CATGACGGCG TCGCGCCCTC CTCTGCGGCA GCCGATGTAC CTGAACCGCC CCTTCCGCTC
CCCCGGACGG CGGGCGCCGG ACCAAGGCCG GAGGTCATCG AACCAGACTC GAAGGGCGCA
AAAAGACGGC GCCGCATCCG GATGCCACGG GGCATCGAGC TCGCCGTCCT CGCAGCTGCA
AACCTTCTGG CCCTCGCAGC ATTCTGCTGG CCGCTGCTCG CCGCGGCCTT CCCCGAGGAT
GCCGCCGCGG CACTCCCCTA TGCGGCGCTG GCCATTGCGC CGATCGCCGT CGTCGCCATT
GTCGTGTCCC TTGACGGCTC GGTCCGCTCC GCACACACGG TGGCGCTGCT CGGCGTCCTG
GCGGCAGTAG GTTCGGCGGT CCGGGTGGCC AGCACCGGCG TCGGGGGTGT AGAGGCAGTC
TTTATCCTGC TGATCCTGGC CGGCCGGGCC TTCGGCCCCC GTTTCGGCAT GCTCCTCGGC
GCCGCCACCA TCGCCCTCTC CAGCGCCCTG TGGGGTGGCA TCGGGCCGTG GACGCCGTTC
CAGATCTTTG CGTGTGCGTG GGTGGGCGCC GGGGCCGGCC TGCTCCCCCG CCGGGTGCGG
GGCAAAGCCG AGCTGTGGAT GTTGTGCGGC TACGGAGTCG TGGCGTCCTA CCTGTTCGGC
CTGCTGACCA ACCTGTGGTT CTGGCCCTTC GCGGTGGGCG CCGGCACCGG CATCTCCTAC
GTGCCCGGCG CACCGCTGGG CACCAACCTC AGCAGCTTCC TGCTCTACTC GCTGTTGACG
TCGACGGCGG GCTGGGACAC CCTGCGTGCC ATCACCACCA TCGTCGGAAT CGCCGTGGTG
GGGCGAGCCA TTCTCGCCGC GCTCCGGCGG GTGAAGCCGG TCTCCGGCGC GGTTCCTGGA
CCCGGCGGGC AGGCCGTGCA GGCTCAATCC GCCCAGGCTA AATCCACCCA GTCCGAGGAC
CGGCTACACA TTGGAGTTTG A
 
Protein sequence
MTFRPAPLRA GAALAVVFIV ARVIYRVLFN GAGIAEPVLL DLPAIRLPAP YAHVVLLGPV 
TAPGLWAAVL SALPIAGMFL GFGLLNAWVD VARGFVHLAR RGPMQGLART LVVAWAALPA
LSDAVTSVRL AFRLRGERFG PRALVPVLER TLEHAARVAA ALELRGFGSR AAPQPGSGTE
TPLFVRNAEF RIGDAQVRVT EFTPSSGSMT VITGPTGSGK STILRGIAGL LSHVDGGEIT
GTVRVAGADR SATPPRDTAR LVGVVLQNPR AAFATTRVRD EIALALELRG MASGAAKARV
LEIAESIGVS ALLDRNVSTL SAGEATLVAI AAAVVEQPAL LLVDEPLADL DTAARGHVIA
VLHALARDAG VCVIVAEHRA EPLVPVADSW WTIDDGDLVP GAAPSPPPRL VDAGPMPAEP
ADFAPVLTAT QLAVHRKGTP LVRDASLTLH RGEVVALVGP NGAGKSSLLV ALALGEGMVS
EGTGSAGKRD VGTVDGGKAD GGRVALVPDA SDDLFTRDTV AGELRAAERR LARRKNGGQP
APGFAASRLA RLRGDVTIPI GQEHPRDLSA GERRILAIVL QTMDDPRVLL IDEPTRGLDP
AARTAVSAAL RAAADSGAAV LIATHDLDFA HSLGARILPM HDGVAPSSAA ADVPEPPLPL
PRTAGAGPRP EVIEPDSKGA KRRRRIRMPR GIELAVLAAA NLLALAAFCW PLLAAAFPED
AAAALPYAAL AIAPIAVVAI VVSLDGSVRS AHTVALLGVL AAVGSAVRVA STGVGGVEAV
FILLILAGRA FGPRFGMLLG AATIALSSAL WGGIGPWTPF QIFACAWVGA GAGLLPRRVR
GKAELWMLCG YGVVASYLFG LLTNLWFWPF AVGAGTGISY VPGAPLGTNL SSFLLYSLLT
STAGWDTLRA ITTIVGIAVV GRAILAALRR VKPVSGAVPG PGGQAVQAQS AQAKSTQSED
RLHIGV