Gene Franean1_7065 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_7065 
Symbol 
ID5675375 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8621278 
End bp8622615 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content62% 
IMG OID641245910 
Productalpha-L-arabinofuranosidase B 
Protein accessionYP_001511301 
Protein GI158318793 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0253597 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACTGTC TGCGTCCGCG CGAACCGTCT GCGGCCGCCG CCACGGGGTC TACACTCAGG 
CTGACAGACA CAGAAAGTGG GTGCCTCCCT GCGGTCGGCA GGACATCCCG ATACACGATG
GCGGCGTGTA TCCAAGCCAG CGACAGGACC ATGTATTTGT ATGAGTCGTT GGACGCCACC
GGGTTCGGGC TGCTGAGGGA TTCCGGATAC ACACCCCCAT GTGGACTGAC TCGCGACCCC
AGCATCATGC GGCACACCGA CGGGATGTAC TACGTCGCCC ACAGCACTGC TCGGACCGGA
AATCAGATCG GCCTCGCCTC AACTACCGGC CGCGTCAACT GGACCTTCCT CGGCAACGTC
ACGTTGATGG CGTCCCGAGT AGGAAGTGTC TGGGCTCCCG AATGGTTCAT CGATGCCGAT
GGTCGCGTTA ACATAGTCGT CTCTCTGCGG GTCGGCGGCG GCGCACGGGG CGCTTTTCGG
CCGTACAAGA TAACGGCAAC GGACTCCTCG CTTGCCACAT GGAGCACTCC GACGCCGTTG
GTTGGCTTAC CTGCAGGTTG CAATGAATTG TTCATTGTCA GGGTCGGACC GACGTACCAT
GCATTCTATA GGAACAGCCG TTCGAAGCGT ATCGAGTTGG CCAGCGCGCC TAGCCTCACC
GGCCCATACA CGACCTGGAG ATCTCCGGAC TCTGTCGTTG GCGACGGCCG GGGGCCGGCT
CTGATACCCC TGGACAACGG CGGCTGGCGC CTGTACTTCG AGCACTGCAG GCGGGGCGGA
GTCTGGTTTA CCGACAGCTA TGACACCTTT CGCAGCTGGT CCGCACCGGT CGAGCTTCCG
GGACTGTCCG GTGTCGTGAA GCATCTGACA GTTGTGAAGG AAGTGGTCCC GGGCGGTGCG
ACCCTTCCTC TGGGCAGACT TTCGCTGAGA TCGGGCGATC TCGGGCACCA GTACTGGCGG
CTGGTCAACG ACGTCGTCTG TCTCGAGGTG GTCACGTCGG ACAGCAGCAC TGCTGTCCGA
CGAGAAGCGA CCTTCACCGT TGTTCGTGGC CTCGCCGATC CAAACGGTTT CTCGTTGCGC
GCATCCGACG GACGGTTCTT GCGACAGGGC AGATCCCGGC TGCATCTGAG CGGGTTTGGT
GCGACCGAGT CCTTCGACCG AGATGCCACG TTCGCGGTCC GACGTGGCTT AGGTGCCAGG
TCAGTGGCCC TAGAATCCTA TAGCCATCCG GGCCGGTATA TCCGCCGTCG CGGCCGAGAA
CTATGGCTTG AGCCTTATAA TGAGAGCTAT CGATTCCGCA CGGAGACCTC CTTCGGCATC
ACGGACGCCT GGTCCTGA
 
Protein sequence
MNCLRPREPS AAAATGSTLR LTDTESGCLP AVGRTSRYTM AACIQASDRT MYLYESLDAT 
GFGLLRDSGY TPPCGLTRDP SIMRHTDGMY YVAHSTARTG NQIGLASTTG RVNWTFLGNV
TLMASRVGSV WAPEWFIDAD GRVNIVVSLR VGGGARGAFR PYKITATDSS LATWSTPTPL
VGLPAGCNEL FIVRVGPTYH AFYRNSRSKR IELASAPSLT GPYTTWRSPD SVVGDGRGPA
LIPLDNGGWR LYFEHCRRGG VWFTDSYDTF RSWSAPVELP GLSGVVKHLT VVKEVVPGGA
TLPLGRLSLR SGDLGHQYWR LVNDVVCLEV VTSDSSTAVR REATFTVVRG LADPNGFSLR
ASDGRFLRQG RSRLHLSGFG ATESFDRDAT FAVRRGLGAR SVALESYSHP GRYIRRRGRE
LWLEPYNESY RFRTETSFGI TDAWS