Gene Franean1_4234 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4234 
Symbol 
ID5672589 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5041925 
End bp5043133 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content75% 
IMG OID641243107 
Productsugar transporter superfamily protein 
Protein accessionYP_001508524 
Protein GI158316016 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.350034 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCAGCC GGCTGACCGA CCTCGTCGTC TACCGGTTGG GCGGGAACGT CCTGCTGCAC 
GGCGGGGACC TCTACTCGGC GGTGGAACCC GCCTCGGGGC TGCCGTTCAC CTACACACCC
TTCGCGGCGG CGGTGTTCAC GCCGCTCGCA CTGCCGCCGC GGCCGGTCGC GCAACTGGTC
TGGACGCTGC TCCTGTTGGT CTCGCTGTAC TTCTTCTGCG CCACCTCGCT GGCGGCGACC
CGGCCGTCCC CGGCCGGCCG CTGGCCCGGG GTCCGCGCCG TCGGCCCGGT CGTCGGCCTG
GCGATGCTGG CCGAGCCGAT CCGGCGGAAC TTCGACCTGG GCCAGATCAA CATCGCGCTG
GGGCTGCTCG TCGCGCTGGA CCTGTTCGGC CGGCGCGGGC CGCTGCCGCG GGGTGTGCTC
ATCGGGATCG CCGCCGGGAT CAAGCTCACC CCACTGATCT TCGTGCCGCA TCTGCTTCTC
ACCGGCCGCC ACCGCGCCGC CCTGACCGCG GTGGCGACGT TCGGCGTGAC GATCGGCGTC
GGCTTCGCGG CGAGCCCCGG CTCGTCGGCG ACGTACTGGT CCGAGACGTT CCTCGACACC
GGCCACGTGG GTGGCGTCCC GTTTGCCGGC AACCAGTCAC TGCTCGGCGT CCTGGCCCGC
CTCATGTCCG GCGCGGACAA CGCGCGCCCG CTCTATCTCC CGCTGGCCGC GGTGGTCGCC
GCCGTCGGTC TGGCGACGGC GGCGCGGCTG TTCCGCTCGG GTGCGCGGCT GCCCGGGGAC
GTCACCTGCG CTCTGACCGG CCTGCTGGTG TCGCCGATCT CGTGGAGCCA TCACTGGGTG
TGGGCGGTCC CCGCGCTTCT CTGGATGTTC GCGGCCCCCG GTCGGCCGCG GTGGGGACGG
GGCGCCGCGG TGGCCGGCTA CGGGCTGCTC ATGGCCGCTC CGATCTGGTG GGTCCCGAAC
ACGGGCGACG CCGAGTTCTC CCACCACGGC TGGCAGCTGC TGGCCGGGAA CTGCTACTTT
CTCGCCGCGC TGCTCCTTCT CGGGGCGCTG GCGGCGTGGG CGCCGACGGG GGCCCGCGGC
GGGTCGCGGG CCCCCGTGCC ACGGCTGCCC GACCAGCGGA TCGTCGGGCG GGGTGTGCCC
GGGCCGCGGG CCCCTGACCC GCGGCGGCGA TCAGGAGATG TGGCCGGCGC CCTTCCCGGA
GGCGAGTAG
 
Protein sequence
MTSRLTDLVV YRLGGNVLLH GGDLYSAVEP ASGLPFTYTP FAAAVFTPLA LPPRPVAQLV 
WTLLLLVSLY FFCATSLAAT RPSPAGRWPG VRAVGPVVGL AMLAEPIRRN FDLGQINIAL
GLLVALDLFG RRGPLPRGVL IGIAAGIKLT PLIFVPHLLL TGRHRAALTA VATFGVTIGV
GFAASPGSSA TYWSETFLDT GHVGGVPFAG NQSLLGVLAR LMSGADNARP LYLPLAAVVA
AVGLATAARL FRSGARLPGD VTCALTGLLV SPISWSHHWV WAVPALLWMF AAPGRPRWGR
GAAVAGYGLL MAAPIWWVPN TGDAEFSHHG WQLLAGNCYF LAALLLLGAL AAWAPTGARG
GSRAPVPRLP DQRIVGRGVP GPRAPDPRRR SGDVAGALPG GE