Gene Franean1_3988 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3988 
Symbol 
ID5672348 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4772891 
End bp4774339 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content68% 
IMG OID641242866 
Productmajor facilitator transporter 
Protein accessionYP_001508283 
Protein GI158315775 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.958906 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGATAG CCGCGCACCC TTCGGCCGAG CCGGCCCCGA CCACAGCTGA CCCGCTGCGT 
TGGCGGGCAC TGGCCGCGAT AGCCGCCGGC CAGCTCATGA TTGCGGCAGA CGTCACGATC
ATGAATATCG CACTGCCGTC CGCCCAGCAC TCCTTGCACC TGTCAACGGC GCAACGACAG
TGGGTGATCA CCCTCTTCGC TCTCGCCTAC GGCGGGTTCC TCCTGCTCGG GGGCAGACTG
TCCGATCTGA TCGGTCGCAA GCGCTGCCTG CTGATCGGAC TGGCGGGTTT CGCCGCAGCC
TCGGCGCTGG GTGGGGCAGC CGTGAACCCA ACCATGCTCC TGGTTGCTAG AGCGCTCCAG
GGCATATTCG GGGCGCTGTT CACACCCTCC GCCCTCGCGC TGCTCGGTAC GACATTCACC
GAACCCTCCG AACGCGGCAA AGCCTTCGGG ATCTACGGCA CCGTGATGGC GGGCAGTTCC
GGCATCGGAC TGATCCTCGG CGGCGTCCTC ACCAACTACC TCGACTGGCG CTGGTGCATG
CTCGTGAGCC TGCCCATCGC GGTCGGCGCT GCCGCCGGAG TCAGCGCGAC GGTTCGCGCG
ACCCCCCGCC GGCTCGGCAC CGAGGTAGAC ATCGTCGGCG CGGTGCTCGC CACAACCGGG
CTCATGGCAC TGGTCCTTGG ATTCACCCGC GCGGAGTCAC AAGGCTGGGC CACCCGGATC
ACGCTGGGCG TTCTTGCCGC CGGAGTCATC CTTCTCGCGC TGTTCGTCCT AGTGGAAAGC
CGCACCGGAG CGGCCCTCCT GCCGCTGCGG GTCGTCCGTG AGCGTCGACG AGCCGGTGCG
TACCTGGCCG TCCTGTGCAT GGCGATCGGC ATGTTCGCCG GATTCTTCTT CCTCACCTTC
TACCTGCAGG ACATCCTCGG ATACTCACCG ATCAAGGCAG GACTCGCGTT CCTCCCGTTC
ACTGCGGCGA TCATGCTAGG AGTACGCGTC ATCCGCGGGT TCCTGATGCG CGCACCCCTG
CGGCTGCTGC TGTGCCCGGG TCTCCTGGCA TGCGCGGCCG GACTCGCACT GCTCGGCCTA
CTACGCGCCG ACGGCGGCTA CGTCACCGGG GCGCTTCCCG TCGTCGTGCT GCTCGGACTC
GGTGTCGGCT GTGTGCTGCT GCCTGCCAAC AACATCGCGA CTCTCGGCGC GGGCCCGGAC
ACCGGCGTCG CCGGCGCCAT CGTGATGACC TCCCAACAGA TCGGCGCCTC GCTCGGCACC
GCCCTGCTCG GCAGCATCGC CGCTACCGCC ACCACCGCCT ACGTCCACTC GCACGCCGCC
GCGGCCGACC TCCCCGCACG GGCCGCGGTG CACGGCTACA ACGTAGCCGG CCTCTCCGGC
GCCGCCTTCC TGTGCCTCGC AACGACCCTG GTGTTCCTCC TTACCGGTCC GAGGAACCCC
AACCAATAA
 
Protein sequence
MRIAAHPSAE PAPTTADPLR WRALAAIAAG QLMIAADVTI MNIALPSAQH SLHLSTAQRQ 
WVITLFALAY GGFLLLGGRL SDLIGRKRCL LIGLAGFAAA SALGGAAVNP TMLLVARALQ
GIFGALFTPS ALALLGTTFT EPSERGKAFG IYGTVMAGSS GIGLILGGVL TNYLDWRWCM
LVSLPIAVGA AAGVSATVRA TPRRLGTEVD IVGAVLATTG LMALVLGFTR AESQGWATRI
TLGVLAAGVI LLALFVLVES RTGAALLPLR VVRERRRAGA YLAVLCMAIG MFAGFFFLTF
YLQDILGYSP IKAGLAFLPF TAAIMLGVRV IRGFLMRAPL RLLLCPGLLA CAAGLALLGL
LRADGGYVTG ALPVVVLLGL GVGCVLLPAN NIATLGAGPD TGVAGAIVMT SQQIGASLGT
ALLGSIAATA TTAYVHSHAA AADLPARAAV HGYNVAGLSG AAFLCLATTL VFLLTGPRNP
NQ