Gene Mflv_1075 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMflv_1075 
Symbol 
ID4972401 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium gilvum PYR-GCK 
KingdomBacteria 
Replicon accessionNC_009338 
Strand
Start bp1122388 
End bp1123728 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content67% 
IMG OID640455271 
Productgeneral substrate transporter 
Protein accessionYP_001132345 
Protein GI145221667 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.801307 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGTTC AGCGGGATTC CGACATCCGC CGGGTGGTCA CCGGAGCCTC GATCGGTAAC 
GCCGTCGAGT GGTTCGACTT CGCCATCTAC GGTTTCCTCG CGACGTTCAT CGCCGCGCAG
TTCTTCCCGG CGGGCAACGA CACCGCCGCG CTGCTGAACA CGTTCGCGAT CTTCGCCGCC
GCGTTCTTCA TGCGTCCACT GGGCGGTTTC TTCTTCGGGC CGCTGGGTGA CCGGATCGGC
CGCCAGAAGG TGCTGGCGGT GGTGATCCTG CTGATGTCGG CCGCGACCCT GTGCATCGGG
TTGCTGCCGA CCTACGACGC GATCGGCGTG GCGGCCCCGC TGCTGCTGCT GGTTCTGCGT
TGCCTGCAGG GTTTCTCGGC AGGTGGCGAG TACGGCGGCG GCGCGGTCTA TCTCGCGGAG
TTCGCCAGCG ACCGGCGCCG CGGACTGACG ATCACGTTCA TGGCGTGGTC GGGGGTCGTC
GGGTTCCTGC TGGGATCGGT CACGGTGACG CTGCTGCAGG CACTGCTGCC CGCGGAGGCG
ATGGAGAGTT ACGGGTGGCG CATCCCGTTC CTGATCGCCG GGCCGCTGGG GCTGGTCGGC
CTCTACATCC GGCTCCGCCT CGGCGACACC CCGCAGTTCG CCGAACTGGC CAAGGCGGAC
AAGAAGGCAG AGTCCCCGCT GCGCGAGGCG GTCGCCACCT CCTGGCGGCA GATCCTGCAG
GTCGTCGGCC TGTTCATCGT CTTCAACATC GGCTACTACG TCGTCTTCAC ATTCCTGCCA
ACGTATTTCA TCAAGACCCT GGGATTCACC AAGTCCTACT CGTTCCTGTC GATCACCTTG
GCCTGCCTTG TCGCGCTCAT CCTGATCCTG CCGCTGGCCG CGCTGTCGGA CCGGTTCGGA
CGGCGCCCGC TGCTGATCGG CGGAGCGGTC GCGTTCATCG TGCTGGGCTA TCCGCTGTTT
CTGCTGATCA CCTCCGGGTC GCCGGTTGCC GCGATCACGG GGCACTGCCT GCTGGCCGCG
ATCGAGTCGG TCTACATCTC GTGTGCGGTG TCGGCCGGCG TCGAACTGTT CGCGACCCGG
GTGCGCTTCA GCGGCTTCTC GGTCGGCTAC AACATCTGCG TCGCGGTGTT CGGCGGCACG
ACCCCGTACG TCGTCACCTG GTTGACCGCC ACCACCGGCA ACTCGATCGC ACCGGCGTTC
TATCTGATCG CGGCGGCCGC CGTCTCGCTG GCCGCGGTCC TGACGCTGCG GGAAACGGCC
CGGCGCCCAC TCGCGCAGGT CCCATACAAT CGCGCAGATG AAGCTCCGGG TGTTGCCCTA
CGTCAGCATC GAGATCGATG A
 
Protein sequence
MEVQRDSDIR RVVTGASIGN AVEWFDFAIY GFLATFIAAQ FFPAGNDTAA LLNTFAIFAA 
AFFMRPLGGF FFGPLGDRIG RQKVLAVVIL LMSAATLCIG LLPTYDAIGV AAPLLLLVLR
CLQGFSAGGE YGGGAVYLAE FASDRRRGLT ITFMAWSGVV GFLLGSVTVT LLQALLPAEA
MESYGWRIPF LIAGPLGLVG LYIRLRLGDT PQFAELAKAD KKAESPLREA VATSWRQILQ
VVGLFIVFNI GYYVVFTFLP TYFIKTLGFT KSYSFLSITL ACLVALILIL PLAALSDRFG
RRPLLIGGAV AFIVLGYPLF LLITSGSPVA AITGHCLLAA IESVYISCAV SAGVELFATR
VRFSGFSVGY NICVAVFGGT TPYVVTWLTA TTGNSIAPAF YLIAAAAVSL AAVLTLRETA
RRPLAQVPYN RADEAPGVAL RQHRDR