Gene Franean1_2301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2301 
Symbol 
ID5670700 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2750963 
End bp2752060 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content72% 
IMG OID641241221 
ProductPectate lyase 
Protein accessionYP_001506642 
Protein GI158314134 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.452858 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.747236 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCACGCC GAACAAGCAC CAGACACACG TACCGGGGGG AATCGACCTC GAGCGGGGAG 
GCGGCACCGG CCGGCCCGGC ACGCAGAGCA CGTCCACGGG CCGGGTGGGG CGTCCCCGTC
CTGATGGCGA CGCTGGCCCT CGGGGCCGGC ACACTGTTCG CCGGGTGCCA GCCGTTCGAC
GGCACCCCGG TGGGTGGCGG GGGCGCTCCG GCACCGACGT CCTCGCCGAC CGCGGGCCCG
ACCGCTCCGG TCACGCAGAC ACCGACAGCC GGGCCGGGTA GCCCGTCGCC GGCGGCGCCG
CCGGTCACCG CGACGCCGAG CGCCGGAACG TCTGCTCCCG CGAGCCCGAC CACGGCACCG
ACCGCAGGGC CGACGACCTC CGCGAGCCCG GGCGCGCCCG CGTCGCAGGG GCCGCTGCCG
AGCTGGCCGC GCGCCACCGC GGACGTGGAC GTCAGCAGCA CCATCTCGGT GGCGGGCACG
TTCGACGGCG GCCTCAAGCG TTACTCCGGC GTCAGCGACA GCGGCCAGGA GGAGGGCCAG
GACCCGATCT TCGAGGTGGC CGACGGCGGA ACGGTCCAGA ACGTGATCAT CGGGTCACCG
GCTGCGGACG GCATCCACTG CAAGGGCACC TGCACCCTGC GCAACGTGTG GTGGGAGGAC
GTCGGCGAGG ACGCGGCGAC GTTCAAGGGC ACGTCCGCGG CACAGACCAT GACCATCGAC
GGCGGCGGGG CCCGCGGCGC CTCGGACAAG GTGTTCCAGC ACAACGGGCC CGGCACGATG
GTCATCCAGA ACTTCCAGGT GCAGGACTTC GGCAAGCTGT ACCGGTCGTG CGGGAACTGC
TCGAAGCAGT ACGCCCGGCA CGTCGTCGTC CGCAACGTCA CCGTGACGGC GCCGGGCAAG
ACGCTGGTCG GCGTCAACAC GAACTACGGG GACTCCGCCC AGCTGTCCGG CGTCACCATC
GTCGGCGACA GCAGCCGGAA GATCAGTATC TGCGACCGCT ACACCGGCAA CAGCAGCGGC
TCCGAGCCGA CGAAGACGGG CAGCGGGGCG GACGGGACGT ACTGCCGCTA CACGCCGTCC
GACATCACCT ACCGGTAG
 
Protein sequence
MARRTSTRHT YRGESTSSGE AAPAGPARRA RPRAGWGVPV LMATLALGAG TLFAGCQPFD 
GTPVGGGGAP APTSSPTAGP TAPVTQTPTA GPGSPSPAAP PVTATPSAGT SAPASPTTAP
TAGPTTSASP GAPASQGPLP SWPRATADVD VSSTISVAGT FDGGLKRYSG VSDSGQEEGQ
DPIFEVADGG TVQNVIIGSP AADGIHCKGT CTLRNVWWED VGEDAATFKG TSAAQTMTID
GGGARGASDK VFQHNGPGTM VIQNFQVQDF GKLYRSCGNC SKQYARHVVV RNVTVTAPGK
TLVGVNTNYG DSAQLSGVTI VGDSSRKISI CDRYTGNSSG SEPTKTGSGA DGTYCRYTPS
DITYR