Gene Arth_1836 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1836 
Symbol 
ID4445630 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2060331 
End bp2061653 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content62% 
IMG OID639689654 
Productmajor facilitator transporter 
Protein accessionYP_831326 
Protein GI116670393 
COG category[R] General function prediction only 
COG ID[COG2270] Permeases of the major facilitator superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.860837 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTCGC GTGAAACGCA TGCTGCAGTT CCCGGCGCCT TGGCGCCGGC GTTGGCAGTG 
GAAGCGGAAC TTCCCGCTCC GGCCCTTGAC GCCCCCAAGG TCAGCGGCCG CTACATCTGG
CTGATGGTGC TCGCACAGTT TGGTGTGTTC GTAGCATTCA TCACGCCTCT GGCTATTTCC
CTGGCGATCA GGGTGAACCA GTTGGCGCCA ACTAACCAGG AATATTTGGG TTACATCACC
GGGGCAGGTG CGTTGGCCGT GATGGTTACC AGCCCGTTCC TTGGCATGGT CAGTGACCGC
ACCAGGACCC GGATCGGGCG CCGACGCCCC TTCATGATCG CCGGAACACT GTTGGGAGTC
ATCTCGCTCC TGGTCATGGC ATATGCCCCC AGCGTGCTGA TCCTTGGAGC CGGCTGGATC
CTGGCCCAAC TGGGGTGGGG TCAGGTACTG AGCAATCTCC AGATTTCGAC GGCGGACCGG
CTGCCTGAAT CCCAGCGGGG CAAGGTGGCA GGCCTCACGG GCTTCTCCAC CCAGGTTGCA
CCCGTTTTCG GCGTCGTTAT CGCCGGCGGT TTCGCAGCTG ACCCACTGCT GCTCTTCATG
GTTCCCGGTG TGGTCGGTGT GCTTCTCGTA GCACTGTTCG TCCTGTTCGT TCACGAGGCT
GATAGCCGCG GCATGGTTTT CTCCGCAAAA ATGACGCCAG CGTCAATGCT CCGCAACTAC
CTCTACAACC CCAGCCAGTA CCCAGATTTC TCCTGGAACT GGCTGGGACG GTTCTTCTTT
TACTTCGGCC TGACGCTGAA CACGACGTTT AGTGCCTTCT TCTTCGCGAG CCGGCTCGGT
ATCCCCGTGG AACAGGTAGG CGGGATCATC GCCACGCTCG GCGGTGCCGG CGTCCTGGCC
ACAACTGCGG GCGCGCTGGG TGGGGGCTTC CTGTCCGACC GTCTGCGGCG CCGCCGCCTG
TTCGTGCTCT TCGGCGGGCT CTTGATGGCT GCCGGGATGA TCTTGATGGC CTTTTCGTCT
GATCTCCCTC TCCTCATTGC CGGTTCCTTG GTCACATCGA TGGGTATCGG GATGTTCTCA
GCGGTGGATC AGGCCCTGTT GCTGGACGTT CTCCCTGAAA AGGCCACCGA CGCCGGGCGC
TTTATGGGGA TCACCGGTTT CGCGACCTCT ATCCCACAGT CGGCCGCACC CCTGATCGCC
CCGATCTTCC TTGCTATCGG TGCCGCTGGA GACCAGAAGA ACTACACCCT GCTATTCGTG
GTCGCGGCAG GTTTCGTTCT GCTCGGTGGC GCCGTCATCA TGCGGATTCG CTCCGCACGC
TGA
 
Protein sequence
MNSRETHAAV PGALAPALAV EAELPAPALD APKVSGRYIW LMVLAQFGVF VAFITPLAIS 
LAIRVNQLAP TNQEYLGYIT GAGALAVMVT SPFLGMVSDR TRTRIGRRRP FMIAGTLLGV
ISLLVMAYAP SVLILGAGWI LAQLGWGQVL SNLQISTADR LPESQRGKVA GLTGFSTQVA
PVFGVVIAGG FAADPLLLFM VPGVVGVLLV ALFVLFVHEA DSRGMVFSAK MTPASMLRNY
LYNPSQYPDF SWNWLGRFFF YFGLTLNTTF SAFFFASRLG IPVEQVGGII ATLGGAGVLA
TTAGALGGGF LSDRLRRRRL FVLFGGLLMA AGMILMAFSS DLPLLIAGSL VTSMGIGMFS
AVDQALLLDV LPEKATDAGR FMGITGFATS IPQSAAPLIA PIFLAIGAAG DQKNYTLLFV
VAAGFVLLGG AVIMRIRSAR