Gene Francci3_4004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4004 
Symbol 
ID3906965 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4787894 
End bp4789228 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content71% 
IMG OID637881333 
Productmajor facilitator transporter 
Protein accessionYP_483083 
Protein GI86742683 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGGGCC ACGGCGCGGC CCGGCGAGAC GTACCAGAGC CCAGGGTGAC TCGTCCCCCT 
GTGGAGGCTT CATCCGCGTC GACGCGCGAG GTCCGACTCG ACGGGGCCCC GCCGAACTCG
ACAGACCACG GTGATGCGCC CTGTCCCCGG CCCGAGACGC GCGACCGGCA GCGGTGGGGC
GCGCTGATCA CCGGCTGGCT GGCACTGTTC GTCATCGGGA CCGATCTGTT CGTCGTCTCG
CCCCTGCTGC CGATCTGGCG CTCCCGGTTC GACGTGTCAC TGCGCACGGC CGGACTGACG
GTGACGGTCT TCGCCCTGGC CTACGTCGTC GCGGCACCCG CGTGGGGTCG GCGAGCGGTG
AAGGTATCGC CGCCCGCGGT GCTGGTTCTG GCCCTGTCCG GCTTTGCGGT GTGCAATGTG
TTGACCGCCC TCGCCCCGAA CTTCTGGACG TTGCTGGCCG CTCGGGTCGG CACCGGTCTG
TTCATCTCCG GCGCCACGGC CACCGTGTTC ACACTGGTCG CAAGCAGCGC CCCCAACGGG
CGACGGGCGT CGTGGCTGGG GATCGCCACC TCAGGGCTGC TGTCCGCGCT GTGGGCCGGA
GCCCCTCTCG GTGGCCTGGT CGCACGGCAC ACCAGCTGGC GGGCGGTGTT CGTCATCCTG
GCCGCATTCG CCGTCCTCAT TCTGGTTGCT GCCCGCAGAA TCTGGCCGGA CACACAGGCA
GCCCGCGCCG CCGCTGCGCC CGCCAGCCGC AGTGCCGCGT GGCGGGCCCG CGCCGTCGCG
CCGACCGCCT GCTGGGCGGC AGCCGTCTAC GGCCTGTACA CCTATCTGTC GGCTGGCCTC
GGTGACCGGC CCGGATGGTC CACCGCCTGG CTGAACGCAT CGTTGATCGT GTACGGGTTG
TGCGCTGTCG CCGCCACGTT CCTCGGCGGT CGGATCGCAG ACCGCCACGG CGCCGCCCGC
ACCACCTGGA CGGCACTCCT GCTGCTCGCC GCCGCCGACG TCGCGTTCTC CGCAAGTCTC
TCCAGCGCGG CGGTCACCAC CTGCCTTGCG ATCGCGTTGC TCGCGTTCGC CGCCTACACC
GCCTTTCCCG CCAAACAGGC TCAGCTCGTC TCCGACCATC CCGCCGACTC CGCCCAGCTG
ATGTCCTGGA ACCAGAGCGC CATGTACCTG GGGATCACCC TCGGCTCTCT GGCCGGCGGA
CGCATCGCCG ACGGCCACTT CCGCGCGCTG CCACTCGCAT GCGCGGCGGT CGCCATCCTC
GGCGCCACCA CACAGGTATC CACCGTCCGC CGCCGGACTC CGGCCGGTCA TGCAGCGGAT
AGTGACCGGT GGTGA
 
Protein sequence
MTGHGAARRD VPEPRVTRPP VEASSASTRE VRLDGAPPNS TDHGDAPCPR PETRDRQRWG 
ALITGWLALF VIGTDLFVVS PLLPIWRSRF DVSLRTAGLT VTVFALAYVV AAPAWGRRAV
KVSPPAVLVL ALSGFAVCNV LTALAPNFWT LLAARVGTGL FISGATATVF TLVASSAPNG
RRASWLGIAT SGLLSALWAG APLGGLVARH TSWRAVFVIL AAFAVLILVA ARRIWPDTQA
ARAAAAPASR SAAWRARAVA PTACWAAAVY GLYTYLSAGL GDRPGWSTAW LNASLIVYGL
CAVAATFLGG RIADRHGAAR TTWTALLLLA AADVAFSASL SSAAVTTCLA IALLAFAAYT
AFPAKQAQLV SDHPADSAQL MSWNQSAMYL GITLGSLAGG RIADGHFRAL PLACAAVAIL
GATTQVSTVR RRTPAGHAAD SDRW