Gene PC1_3901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPC1_3901 
Symbol 
ID8134888 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePectobacterium carotovorum subsp. carotovorum PC1 
KingdomBacteria 
Replicon accessionNC_012917 
Strand
Start bp4388115 
End bp4389365 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content59% 
IMG OID644867208 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003019452 
Protein GI253690262 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCTTGA GTGGGCAGGT GGCGTTCATC ATCCACTTTA TGTTTGTCGT ACAACTGGTG 
GCGATGGGGG CGATGGAGAT GAGCGGGCCG TTTTGGCCGC TGCATCTGGA AAGTATGTCG
TCCGGTGCGG AACTGAGTAT CGCGGGGATT GCCGTGTACA TCGGGCCGAT GCTGGGCATT
ATGCTGACCA GCGCCTTCTG GGGACGAATG GGCGATCGGC TGGGCAATAA AGCCATGATG
ATCCGCGCGC TGTTCGGGCT AGCGTTAACC CAGCTTGGGC TGGCGTGGGC CAATGACATC
TGGACGATCG TCGCGCTGCG TTTTATTCAG GGCGCCTGTG CGGGGTATAT CGCGCCCGCG
CAGGCCTACG GTGTCGCGGT CGTCAGTCCG TTACAGCGTA CGCGGCTGTT CGCCTGGCTT
CAGGTGTCTA CTAACGTGGG ATCGCTGCTG GGGGCGATTG TCGGCGGGCT GATCCTCGAC
TACCTGAACT TCTTCTGGAT CAACCTGAGC GCCGCGATCC TGTGCGCGCT GTGTGGCATT
ACCGTGGCGC TGTTCCTGCC GCATGTCGCC CCCGATGTCC CTGCGGTTCC GCCTGCGGAT
GCACAGGAGA AAAGCACACC GCGCAGTCGG CTTTGGGCGC TGTCGCCGAT TTCCGGCCTG
CTGCTGATTT CCGGCCTGTT GCTGGCCAGC CGGATGATTC CGCAAACGCC GTTTTCCCTG
TATATGGATG GCATTTTTCA GGTGGATAAA TGGATTATCG GCCTGTGCTA TGGCTTGCAG
GCGACCGGTG TGATTGTTTC TGCATCGCTG TGGGCGCGCT ATTTTGAAAA CCTCTCGCTG
TCGCAGACGC TGAGCCGCTT GTGTGTGGTT ATGCTGGCCT GCGCCATCGT CACATTGACG
GCCGCCACGA TCCTGAATAT CGCGATTTTC ATCCCACTTT ATTTCCTGTG GGGCGTCCTG
CTGGGGGCGA CGACGCCGGT TCTGATGGCG CTGATTTCTC GTGCGGCTGG TGCCGGACAG
CAGGGTTACA TACTCGGTGT GGCGCAAAGC GTCAGCCAGT TTGCCTCGAT TCTGGGCATT
GCTTTGGGCG GATTGGTGCT CTACTCCCCC GGACTACGTT CGCTATTCTT CTGCGTTGGT
GCCGCGTATC TGGTGACCTT CCTGGTCTCG CTGATGCTGC TACGACACCT GCGGAAACAG
GCGGAAAAAC ATGGCTCTCT CTCGACGAAG GGAAATATCG AAAATGTGTA A
 
Protein sequence
MRLSGQVAFI IHFMFVVQLV AMGAMEMSGP FWPLHLESMS SGAELSIAGI AVYIGPMLGI 
MLTSAFWGRM GDRLGNKAMM IRALFGLALT QLGLAWANDI WTIVALRFIQ GACAGYIAPA
QAYGVAVVSP LQRTRLFAWL QVSTNVGSLL GAIVGGLILD YLNFFWINLS AAILCALCGI
TVALFLPHVA PDVPAVPPAD AQEKSTPRSR LWALSPISGL LLISGLLLAS RMIPQTPFSL
YMDGIFQVDK WIIGLCYGLQ ATGVIVSASL WARYFENLSL SQTLSRLCVV MLACAIVTLT
AATILNIAIF IPLYFLWGVL LGATTPVLMA LISRAAGAGQ QGYILGVAQS VSQFASILGI
ALGGLVLYSP GLRSLFFCVG AAYLVTFLVS LMLLRHLRKQ AEKHGSLSTK GNIENV