Gene Franean1_5309 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5309 
Symbol 
ID5673643 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6391333 
End bp6393675 
Gene Length2343 bp 
Protein Length780 aa 
Translation table11 
GC content77% 
IMG OID641244166 
Product4-alpha-glucanotransferase 
Protein accessionYP_001509573 
Protein GI158317065 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1640] 4-alpha-glucanotransferase 
TIGRFAM ID[TIGR00217] 4-alpha-glucanotransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.242878 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACATCCG CGCCCGAGCG CCCCGACCAG CCCGCCGCGC CCCCCGCCGC TACCGCGCCG 
CCTACCACTG CGGCGCCGTC AGCCGCCGCG CCGTCAGCCG GCGGCGCGGT CGGAGCCGGT
GGGCAGCAGG GCCCGGCGTC GGTCTGGTCC GGGCTCGGTG ACCTCAAGGC CCTCGCCGCC
GAGTTCGGGG TGGCCACGTC CTACGACGGG CAGGACGGCA CGCCGGTCAC GGTGCAGCCG
AGGGCCGTCC GGGCGGCGCT GGGCCTGCTC GGCGTCGACC CGTCGGACCC GGCCGTCGCG
CTCGCCGGGG CACGCGAGGC GCGTCGGCGG CGCCCGCTGC CCCCGTGTGC GGTCGTCCGG
GCGCAGGCCC CCGCGCCGGT CGCCGTGCAC GTGCCGGATG CCGCCGCGGA CGCCGTCACC
GCCGAGGCCG TGCTGGCCGG CGGGGAGAGC GTGCCGCTGT CGGTCGGGCT GCGCGGCGCT
GTCGGGGAGG TCGACGGCCA TGCCGTCCGC GCGGGCACGG TGGACCTGCC GTCCGGCCTG
CCGCTCGGGG ATCACCGCCT GCGGCTGAGC TTCGGCGGGA GCACCACCGA GTGCCCGCTG
ATCGTCGTCC CGGAGCGCGT GCCCGACTTC GCGGCCGCCC CGAGCCCGGC CGAGGCCACC
GGGCGGGCCT GGGGCTGGAT GATCCAGCTG TACGCGCTCA CCTCCGCGGG ATCGTGGGGG
ATGGGCGACT ACGCCGACCT CGCCACCCTC GCCGAGTGGT CCGCGCGTGA CGGCGCCGAC
GTTCTGCTGG TGAACCCGCT GCACGCGGTG GCGCCGACCT TTCCGGTCGA GCCGTCGCCG
TACTCGCCGG CGAGCCGCCG CTTCGTCTCA CCGCTCTACC TGCGGCCCGA GCTGACCCCG
GAGTACCGGC ACGCCTCCGA GACGGTGCGG GCGGAGGTCG ACCGGCTGGC GGGAGTCGCC
CGCCGGGAGG GGATCAGGGA CGGCCTGATC GACCGGGACG CGGTGTGGCG GGCCAAGCTC
GCCGCCGTCG AGCTGCTGTT CACCTCGTCC GGCGGCGGGA CGGGCGACGG GCCGGCCGGT
GGGCAGGAGG CCGATGGCGC GCTGCGCGAC TTCGCGCTCT GGTGCGCGCT CGCCGAGCGG
CACGGCCGGG ACTGGCGCAC CTGGCCGGAG GACCTGCGCG ATCCCGCCGG GCCGGCGGTC
GACGCCGCGC GCGCCGAGCT GGCGGAGCGC GTCGCGTTCC ATGTCTGGCT GCAGCGGCGG
TGCGACGACC AGCTCGGCGC GGCGCAGGCC GCCGCCAGGA CGGCGGGGAT GCGGGTCGGC
ATCGTCCACG ATCTCGCTGT CGGGGTCGAT CCGGGCGGCG CGGACGCCTG GGCGATGCGC
GGCGTGCTGG CCACCGGGGC CTCCGTCGGC GCCCCGCCGG ACGGCTTCAA CCAGCAGGGC
CAGGACTGGG GCCTCCCGCC GTGGCGGCCC GACGTCCTCG CGGAGAGCGG GTATGCCCCG
TTCCGGGCGA TGGTCGCCGC GGTGCTGTCC CGGGGCGGCG GGCTACGGGT GGATCACATT
CTCGGGCTGT TCCGCCTGTG GTGGGTCCCG GACGGCGCCG GCGCCGCCGG CGGCACTTTC
GTCCGCTACG ACGCCGAGGC GCTGCTGGGG CTGCTCGCCC TCGAGGCGCA CCGGGCCGGC
GCCCTGGTCG TCGGTGAGGA TCTCGGCACC GTCGAGCCGT CGGTGGCCGA GGCGCTCGAC
GGCGCCGGGA TCTTCGGCTC CTCGGTGCTG TGGTTCGAGC AGGCGGCGGA CGGCTCCCCG
CTCCCGCCGC GCGAGTACCG GGCCCGGACC ATGGCCAGCG TGACCACGCA CGACCTGCCC
ACCGCCGCCG GCTTCCTCGA GGGCGAGCAC GTGCGCGTGC GCGCGCGGCT CGGCCTGCTC
GCCCGCACCG ACGAGCAGGA ACGCGCCGCC TGGCTCGCCG AACGCGCCGG ACTGCTGCGG
CTGCTCGCCG ACGAGGGCCT GGTGAGCCCG CCGGCGGGGG TCGTGGCGGA GGAGGATCGC
CTCGAACCGG AGCTGCGTGC GGCGGCCGCG CTCGGCCTGC ACGTCCTGCT TGCCCGGTCA
CGCGCGCGGA TCGTGCTGGT CGCTCCCGGT GACGCGTTCG GCGACGTCCG TCAGCCGAAC
CTGCCCGGCA CGGTCGACAG CTATCCGAAC TGGCGGCTAC CGGTCGTCGA CGACGCCGGG
GAGCGCGTCA CCGTCGAACG GCTGATCACC GATCCCCGGT CGCGCCGGAT GGTCGAGGCA
CTCGAGGCAC TCGGGGCGAT CACCGCCGAT CGGGCGGGAG CCACCACCAC CCGGCGCCCC
TGA
 
Protein sequence
MTSAPERPDQ PAAPPAATAP PTTAAPSAAA PSAGGAVGAG GQQGPASVWS GLGDLKALAA 
EFGVATSYDG QDGTPVTVQP RAVRAALGLL GVDPSDPAVA LAGAREARRR RPLPPCAVVR
AQAPAPVAVH VPDAAADAVT AEAVLAGGES VPLSVGLRGA VGEVDGHAVR AGTVDLPSGL
PLGDHRLRLS FGGSTTECPL IVVPERVPDF AAAPSPAEAT GRAWGWMIQL YALTSAGSWG
MGDYADLATL AEWSARDGAD VLLVNPLHAV APTFPVEPSP YSPASRRFVS PLYLRPELTP
EYRHASETVR AEVDRLAGVA RREGIRDGLI DRDAVWRAKL AAVELLFTSS GGGTGDGPAG
GQEADGALRD FALWCALAER HGRDWRTWPE DLRDPAGPAV DAARAELAER VAFHVWLQRR
CDDQLGAAQA AARTAGMRVG IVHDLAVGVD PGGADAWAMR GVLATGASVG APPDGFNQQG
QDWGLPPWRP DVLAESGYAP FRAMVAAVLS RGGGLRVDHI LGLFRLWWVP DGAGAAGGTF
VRYDAEALLG LLALEAHRAG ALVVGEDLGT VEPSVAEALD GAGIFGSSVL WFEQAADGSP
LPPREYRART MASVTTHDLP TAAGFLEGEH VRVRARLGLL ARTDEQERAA WLAERAGLLR
LLADEGLVSP PAGVVAEEDR LEPELRAAAA LGLHVLLARS RARIVLVAPG DAFGDVRQPN
LPGTVDSYPN WRLPVVDDAG ERVTVERLIT DPRSRRMVEA LEALGAITAD RAGATTTRRP