Gene Franean1_2900 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2900 
Symbol 
ID5671287 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3414988 
End bp3416349 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content67% 
IMG OID641241807 
Productmajor facilitator transporter 
Protein accessionYP_001507227 
Protein GI158314719 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.95287 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGGCG ACCACGGTGG GGCGACACAG TCGAGCCCCG GGAACACGAG CGAGCTTCGT 
CGGGTGATCC TGGCCAGCTA CCTGGGCAGC GCCGTGGAGT ACTACGACTT CCTGCTCTAC
GTGACGGCGG CCAGCCTGAT CTTCAACGAC CTGTTCTTCA GCCAGCTCTC CTCGACCATG
GGGACGATCG CCTCCCTGGG AACGCTGGCC GTCGGCTACG CCGCGCGTCC GCTGGGAGCG
TTAATCTTCG GCCACTTCGG TGATCGGATC GGGCGCAAGT CAGTACTCAT CGTCACGCTC
CTCACGATGG GGATCTCGAC CGCGCTGATC GGGGTACTGC CCACCAGCGA GCAGGTCGGG
GCGCTTGCTC CCGCGCTACT GATCACGCTG CGCATCTTCC AGGGGATCTC AGTGGGAGGC
GAGTGGGGCG GCGCGGCGCT GATGACCTTC GAGCACGCCC CCGCGCACCG GCGCGGGTTC
GCGTCGAGCT TCGCCGGTGC CGGCGGGCCG ACCGGAACGG CACTGGCAGC CGGAATGCTT
GCCCTGTTCT CCCTGCTTCC CGATGAGCAG TTCGACACCT GGGGATGGCG AGTGCCGTTC
CTCTTCAGCG CCGTTATGGT CGGGATCGGC ATGTGGGCGC GTCTGCGTGT CTCGGAGTCG
CCGCTGTTCG TCGAGGAGAA GATCCGGCAG CAGCAGTCCG AGGAGGAGGT CGCCCCGCCG
ATCTGGCGGG TGCTCCGCTC CCCCATCGGC CTGCTCTCCG CATTCTTCGC GCTGCTGGCG
CCGTTCACCT TCAACAGCCT GGCCGGCTCC TTCGCACTCA CCTACTCGAA GGAGAACGGA
CTGCACGTGT CATCAGTTCT CAGCATCCAG GTGGTCGGCG CGGTGGTCTG CGTCGTCTGC
GAGATCGCTT CCGGCACTCT CTCCGACCGC TACGGGAGGC GTGTGATCAT GGGTTTCGGC
ATGCTCGCAG GAGCCCTCCT GACCTACCCA TTCCTGCAGC TGATTGGCTC AGGCCACTAC
GCGCCGACGA TGCTCGGCTT CGTGCTCGTG TACGGCCTGG TCATCGGGCC CATGTTCGGC
GTGTGCCAGG CATTCGTCAG CGAGCAGTTC GACACCGGCT CCCGTTACAC CGGGGCTTCG
CTGGGCTACC AGGCCGCCTC AACACTCGGA GGCGGGTTCG TGCCGATCAT CCTGGCGGCG
CTCCACGACT CGCGGGGCGG TGGCCTGGGC CAGATCACGC TGTTCGTGAT CGCGGTCGGA
TTTTTCGGCG TCGCGACACT GGTGGCCACG TCACGTCGTC GACGGATGCG CCCGCTCGCT
CCCCCGCTGC CGGTACCAGC CACCTCGGTC CTGGCGGACT GA
 
Protein sequence
MGGDHGGATQ SSPGNTSELR RVILASYLGS AVEYYDFLLY VTAASLIFND LFFSQLSSTM 
GTIASLGTLA VGYAARPLGA LIFGHFGDRI GRKSVLIVTL LTMGISTALI GVLPTSEQVG
ALAPALLITL RIFQGISVGG EWGGAALMTF EHAPAHRRGF ASSFAGAGGP TGTALAAGML
ALFSLLPDEQ FDTWGWRVPF LFSAVMVGIG MWARLRVSES PLFVEEKIRQ QQSEEEVAPP
IWRVLRSPIG LLSAFFALLA PFTFNSLAGS FALTYSKENG LHVSSVLSIQ VVGAVVCVVC
EIASGTLSDR YGRRVIMGFG MLAGALLTYP FLQLIGSGHY APTMLGFVLV YGLVIGPMFG
VCQAFVSEQF DTGSRYTGAS LGYQAASTLG GGFVPIILAA LHDSRGGGLG QITLFVIAVG
FFGVATLVAT SRRRRMRPLA PPLPVPATSV LAD