Gene Franean1_2218 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2218 
Symbol 
ID5670617 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2653537 
End bp2654817 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content78% 
IMG OID641241138 
Productaminotransferase class V 
Protein accessionYP_001506559 
Protein GI158314051 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.245886 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0443202 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACCGC GCCGGAGGAC GATCCCCGCC CTCGATGCCG ACCGGGCCCG CCTGAGCCGC 
GGGCACGCCG TCGTTCCGGA ACCCGAGATC GGCCGTCGCT GGCGCGCGGC GCGGGCGCGG
GCGGGCGTCG TTCACCTGGA CGCCGCCGCG GCCGCCCGGC CGAGCATCGC GACCGTCACC
GACCAGACCG ACCACCTGCG CCAGGAGAAC CTGGTGGGCG CCTACATCGC CGAGATGGAG
GCCGCGGAGG TCCTCGCCGG CGCTCGGGCC CGGCTCGCGA CGCTGCTGGG CCCCGGGCTC
ACCGCCGACG ACGTGGTCTT CCAGCATTCC GCCAGCACCG GATTCGCCGC GCTCCTGGCC
GCCTGGCCAC TGCCCCCCGG TTCCCGCGTC GGCGTCGTGC CCGGGGAGTA CGGGTCGAAC
CTGCTGCTGC TGGCGACCCG CGCGGCCCGC GACGGCCTGG AGCTGGTCGA GCTCCCGGTG
GACCCGCTGG GCCGCATCGA CCTCGACCGC CTCGACCGCG TGCCCGGCCC GGCCCGCCTC
GAGGACCTCG CCCTGGTGAC GTTCCCGCAC GTCCCGAGCC AGCGCGGCGT CGCGCAGCCC
GCGGCGGAGG CGGCGGCCCG CTGCGCCGCC GCGGGCGTCG ATCTGATCCT CGACGTGGCC
CAGTCGCTCG GCCAGGTCGA CCTCGCCGGC ATCGGGGCGG CCGCCTACGT CGGTACCTCA
CGCAAGTGGC TGTGTGGCCC CCGCGGCGCG GGCTTCACCG CGGTGCGCCC GGACGTCGTC
GACCGCCTCG GCCCCGGGGC GCCCAGCCTC CACTCGGCCC ACCCCCACGA CCTGCGGGCC
CATCCCGCGC TGCCCGCGCG ACCGCTGCCG GGCCCATCCC GGATGGCGGT CGGCGAGGCG
GCGGTGGCGA GCCGGGTCGG GCTGGCCACC GCGCTGACCG AGCTGCTGGC CGAGGACCTC
GGCGCGATGC GCGACCGGAT CATCGCCCTC GCCCGGCACG CGCGCCGCAC CCTCGACGGA
GTCGCCGGCT GGCGCCTCGG CGAGGAGGCC GACTCCCCCA CCGGCATCGT GACCCTGCGC
CCACCGGCCG GTGTCGATCC GCTGGCGGTC TGCCGGGCGC TGTACGTCGA GGCCCGGATC
CTGACCAGCC CGGTCCCGGC CGGCCGGGCA CCCGAGCTGA CCGCACCGGT GCTGCGGGCG
AGCACGCACG TCTACAGCAG CCCCGCGGAG ATCGAGCAGC TCGCCGAGGC CCTCGACAGG
TGGGGCCGCC CGGGGCCGTG A
 
Protein sequence
MEPRRRTIPA LDADRARLSR GHAVVPEPEI GRRWRAARAR AGVVHLDAAA AARPSIATVT 
DQTDHLRQEN LVGAYIAEME AAEVLAGARA RLATLLGPGL TADDVVFQHS ASTGFAALLA
AWPLPPGSRV GVVPGEYGSN LLLLATRAAR DGLELVELPV DPLGRIDLDR LDRVPGPARL
EDLALVTFPH VPSQRGVAQP AAEAAARCAA AGVDLILDVA QSLGQVDLAG IGAAAYVGTS
RKWLCGPRGA GFTAVRPDVV DRLGPGAPSL HSAHPHDLRA HPALPARPLP GPSRMAVGEA
AVASRVGLAT ALTELLAEDL GAMRDRIIAL ARHARRTLDG VAGWRLGEEA DSPTGIVTLR
PPAGVDPLAV CRALYVEARI LTSPVPAGRA PELTAPVLRA STHVYSSPAE IEQLAEALDR
WGRPGP