Gene Franean1_2326 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2326 
Symbol 
ID5670724 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2774786 
End bp2776063 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content69% 
IMG OID641241245 
Productmajor facilitator transporter 
Protein accessionYP_001506666 
Protein GI158314158 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGAACG TCCCCTCAAG CCCCGGCGGA CTTTCCCATC GGATGATCTT TCTGCTTGCC 
CTGTCCTGTG GCATCGGCGT GGCGAACATC TACTTCCCGC AGGCCATCAG CCCGCTCATC
GCGGACGGCC TGCACGTGTC CCGCAGCGCG GCGGCGACGG TCGTGACCGC CAGCCAGTTC
GGATACGCGG CCGGGATCTT CCTGCTGGTA CCCCTCGGCG ATCGGCTGGT GCCCCGCGGG
CTGCTCGTCA CCCTGCTGGT CGTGGTCAGC CTCGGGCTGC TCCTGGCCGG CACCGCGCAG
ACGCTGCCTG TGCTGATCAT CGCGAGCGGC CTGGTCGGCC TGACGACCGT CGTTCCGCAA
ATCATCATTC CGATGACCGT CGGGTTCATG CCGGAGAACC GGCGCGGCGC GGTGACCGGA
ACGTTGCTCA GCGGCCTGAT CGGCGGCATC CTGCTGGCCC GCAGTTTCAG CGGCGGGCTC
GGCCAGTGGC TCGGCTGGCG GGCACCCTAC CTGGTCACAT CCTCCCTTGT GCTGCTGCTC
GCTATCGTCC TGGCCGTTAC CCTGCCCACC ACGACTCCCT CGTCACGGGA GCGCTATCCA
GCTCTGCTGG CGACCTCCGT GCGGCTGCTG CGAACCGAGC CGGACCTGCG CCGCTCCTGC
CTGTATCAGG CGCTCGTCTT CGCCGGCTTC ACCGGAGCCT GGACCAGCAT CGCCCTGTAT
GTCACCGGAT CCACGTACAA GCTGGGAGCG TCAGAGATCG GCCTGATAGC CCTGGTCGGC
GCGGCCAGCA TGTTCTGCAC ACCGGTCGCC GGGCGCCTGG TCGACCGGCG CGGCCCCGAC
GTGGTCAACC TGGTCAGCAT GGTCGGCGCG ATCGCGGCGG CCGGGCTGCT CACCAGCGGA
CGCCTGGGAG GCGCGGTCGG GCTGGTCGGG TTGACGCTCG GCATGCTCGT GCTCGACGTG
GCCATGCAGT CGGGACAGGT GGCCAACCAG GCGCGGATCT TCGCGCTGCG GCCCAGAATG
CGCAGCAGGC TCAACACCGC CTACATGACC AGCGCCTTCC TCGGCGGCAG TGTCGGCTCG
TGGCTGGCTG TCCGCGCCTA CTACAGCCTG GGATGGGACG GCGTATGCCT CCTCGTCGCT
GTACTCGCAG CCCTCGCCCT GGCCCGTCAC CTGCCCGTGT TGCCCGGCCG GACCGCTTCA
CCCCGCTCGA CTGTGTCCGT TCCGCTGCCG CAGGATGCTC CGGCGTTCAG CGCAGGCAGC
GATCTGGCTC GCGACTGA
 
Protein sequence
MSNVPSSPGG LSHRMIFLLA LSCGIGVANI YFPQAISPLI ADGLHVSRSA AATVVTASQF 
GYAAGIFLLV PLGDRLVPRG LLVTLLVVVS LGLLLAGTAQ TLPVLIIASG LVGLTTVVPQ
IIIPMTVGFM PENRRGAVTG TLLSGLIGGI LLARSFSGGL GQWLGWRAPY LVTSSLVLLL
AIVLAVTLPT TTPSSRERYP ALLATSVRLL RTEPDLRRSC LYQALVFAGF TGAWTSIALY
VTGSTYKLGA SEIGLIALVG AASMFCTPVA GRLVDRRGPD VVNLVSMVGA IAAAGLLTSG
RLGGAVGLVG LTLGMLVLDV AMQSGQVANQ ARIFALRPRM RSRLNTAYMT SAFLGGSVGS
WLAVRAYYSL GWDGVCLLVA VLAALALARH LPVLPGRTAS PRSTVSVPLP QDAPAFSAGS
DLARD