Gene Franean1_5314 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5314 
Symbol 
ID5673648 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6398904 
End bp6400625 
Gene Length1722 bp 
Protein Length573 aa 
Translation table11 
GC content75% 
IMG OID641244171 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_001509578 
Protein GI158317070 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0702] Predicted nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.492439 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGCATAC TCGTCACCGG TGCGACCGGG TATATCGGTG GTCGGCTGGC ACCCCGCCTG 
CTCGGCCAGG GCCATCATGT CCGCTGCATG ACCCGTGATC CGGCGCGGCT GTCCGACGTC
GGCTGGGCCC GGCATCCCGA CATCGAGGTG GTCCGTGCCG ACGCCCTCGG CCCGGAGTCG
CTGCGCGCGG CGATGGAGGA TGTCGACGTC GCCTACTACC TGATCCACTC GATCGACACC
GGCGGCGACT TCTCGGCCGT GGACCGCCGG GCGGCCGCGG CGTTCGCGCG GGCGGCGCGG
GCCGCCGGCG TCCGGCGGAT CATCTACCTG GGTGGGCTGA GCCCGGCCAC CGGCATCGGC
ATGTCGGCGC ACCTGGCGTC CCGGCAGGAG GTCGCGCGGA TCCTGCTCGA CTCCGGGGTG
CCCACGGTCG TGCTGCGCGC CGCGATCATC ATCGGCAGCG GCAGCGCGTC GTTCGAGATG
CTGCGCCACC TCACCGAGCG GCTGCCGGTG ATGCTCACCC CACGCTGGGT GCGGACCCGC
ATCCAGCCGA TCGCCGTCCG GGACGTCCTG CACTACCTCA TCGGATGTCT CCAGCTGCCG
GCCGAGATCA ACCGCTCGTT CGACATCGGC GGGCCGGACG TCCTGACCTA CGCCGACATG
ATGCAGCGTT TCGCGGCGGT CGAGGGCCTG CGTCGCCGAG TGATCATCCG TGTGCCGGTG
CTCTCCCCCG GCCTGTCGTC GCTGTGGGTC GGGGTGGTGA CGCCGGTCCC CCGGGCGATC
GCGCGGCCGC TGGTCCGCTC GCTGCGGACG GAGGTGGTCG TCGGCGAGCA CGACATCGCG
CGGTGGGTGC CGGACCCGCC CGAGGGGCTG CTGCCGTTCG AGACCGCGGT CGCCTACGCG
CTGGCGCGGG TGCGCGACCG GACAGTGGAG ACCCGGTGGT CGACTGCGGT CTGGCCGGGC
AGCCCGCTCG CCGCGCCGTT CCCCGAGGTC GCCGCATACC TCTCCGCCGA CCGGGCCGGC
CGCCGCGGGC ACCGCCCGGT TGCGGCCGGT GTCCGGACCG CGGACGCCAG CCGGACGGCG
GCGGACGGCG GCTCGCCGGA CAACGCCCCG TCCAACGGAA TCCTGCCCGG CGGCGCTCCG
CCGGGCGAAC CCGTCCCGAC CGATCCGGCC TGGGCCGGGA GCTCGCTCTA CAGCGACGAG
CGGTCGAAGG CGGTGGCCGC GCCCGCGGAC CGTCTGTGGC AGGTCATCGA GGGGATCGGC
GGCGAGAACG GCTGGTACAG CTGGCCGCTG GCGTGGTCGG CGCGCGGGTG GCTCGACACG
CTCGTCGGCG GGGTCGGGCA CCGCCGCGGC CGGCGCGACC CAGCCCACCT GCACACCGGG
GAGGCGATCG ACCTGTGGCG GGTCGAGGAG CTGGTCCCCG GACGGCTGTT GCGCCTGCGC
GCCGAGATGA GGCTGCCCGG CCACGCCTGG CTCGAGCTGC GCGTCGGGCC AGGCGGCCCC
GACGGCAGGA CGGTCTACCG GCAGCGGGCG CTGTTCCTGC CGCGCGGGCT GCCGGGCCAC
CTCTACTGGC GGGCCGTCAG CCCGTTCCAC GCCGTGGTCT TCGGCGGAAT GCTGCGCACC
ATCGTCCGCC GGGCCGAATC CGCCCCGGCC GCGGCCGCGC CGGCTGTGGC GGCCCCGACG
GAGTCGACTT CGATCGAGCC GACCACGCCG GCCGGCCGGT GA
 
Protein sequence
MRILVTGATG YIGGRLAPRL LGQGHHVRCM TRDPARLSDV GWARHPDIEV VRADALGPES 
LRAAMEDVDV AYYLIHSIDT GGDFSAVDRR AAAAFARAAR AAGVRRIIYL GGLSPATGIG
MSAHLASRQE VARILLDSGV PTVVLRAAII IGSGSASFEM LRHLTERLPV MLTPRWVRTR
IQPIAVRDVL HYLIGCLQLP AEINRSFDIG GPDVLTYADM MQRFAAVEGL RRRVIIRVPV
LSPGLSSLWV GVVTPVPRAI ARPLVRSLRT EVVVGEHDIA RWVPDPPEGL LPFETAVAYA
LARVRDRTVE TRWSTAVWPG SPLAAPFPEV AAYLSADRAG RRGHRPVAAG VRTADASRTA
ADGGSPDNAP SNGILPGGAP PGEPVPTDPA WAGSSLYSDE RSKAVAAPAD RLWQVIEGIG
GENGWYSWPL AWSARGWLDT LVGGVGHRRG RRDPAHLHTG EAIDLWRVEE LVPGRLLRLR
AEMRLPGHAW LELRVGPGGP DGRTVYRQRA LFLPRGLPGH LYWRAVSPFH AVVFGGMLRT
IVRRAESAPA AAAPAVAAPT ESTSIEPTTP AGR