Gene Franean1_0914 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0914 
Symbol 
ID5669328 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1063956 
End bp1065380 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content74% 
IMG OID641239841 
Producthypothetical protein 
Protein accessionYP_001505276 
Protein GI158312768 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.447806 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0678288 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGAGG CCGCCGGTTA CCTGCTCGTG GCCGTCCTGC TGATCGCCGG TAACGCGCTC 
TTCGTCGCCG CCGAATTCGC GCTCGTCGCC GTCGAGCCGC ACCAGGTCGA GGAGGCGGCG
AACACGGGCG ACCGCCGCGC CGGAATCGTC CTGCGGGCGG TGCGCTCGCT GTCGTTCCAG
CTCTCGGGCG CGCAGCTCGG CATCACGGTG ACCTCGCTGG TCGTGGGCTA CATCGCCGAG
CCCGCCGTCG CCACGCTGCT CGAACCGCTA CTGGACGTCG CCCACATCCC GCCGTCGCCG
CGCGATGTCA CCGCGATCGT GCTGGGGCTG GTAGTGGCCA CCGTGACGCA GATGGTCTTC
GGTGAGCTCG TCCCGAAGAA CTGGGCGATC TCCGAGCCGG TGCGGGTCGC CCGGGCGGTC
GCGCCGGCAC AGGTCATGTT CTCGCGGGTG TTCCGGCCGC TGATCACCCT CCTGAACGGT
TCGGCGAACG CCCTGCTGCG GGCGATGGGC GTGGAGCCCC AGGACGAGCT GCGCAGCGGC
CGGTCGTCGG ACGAGCTGAG CTCGATCGTG GCGTCCTCGG CCGAGCACGG CACCCTGCCC
GTCACCACCG CGGCCCTGCT GAGCAGGTCG CTGCGCTTCG GCGACCGGCG GGCGTCCGAC
GTGATGACCC CGCGGGTGCG GACGGTCTTC GCCGGGGCGG GCACCTCGCT GGCGGAACTG
CTGCGCCTCG CCGAGCACAC CGGGCACTCG CGCTTCCCCG TCCTGCGCGA GGACGACGAG
ATCGGCGAGG ACGGGTACGG CGTGGTCGGC GTCGTCCACG TCAAGGACGC GTTCGGAGTC
CCGGCGCCGG AGCGGGCGCG GCGGACGGTG CCGGAGATCA TGGTCGAGCC GCTGCTGGTG
CCCGCGTCGC TGCACTGCGA GGTCCTGCTG CGCCGGCTGC GGCGCGGCGG CCTCCAGCTC
GCCGTCGTCA TCGACGAGTA CGGCGGCACG GACGGCATCG TGACGATGGA GGACCTCGTC
GAGGAACTCG TCGGCGACGT CGACGACGAG CACGACCGCC CGGCACCGCC CGACGCGGTG
GCCCTCGGCG CCGGCCAGTG GATGTTGTCC GGGCTGCTCC GCCTCGACGA GGTAAGCGAA
GCGACCGGGG CCCGCCTGCC CGCCGGCCCC TACGAGACCA TCGGCGGGCT CGTCCTGGCC
CGCCTCGGCC GGCTGGGAAG GCCGCGTGAC GTCGTCCAGG TCGAGGGCCA TGAGCTCGTT
GTGGCCTCGG TCGACGGCCA CCGGATCGAC CGGGTACGCC TCAGCCCCAC GGAAGCCACG
GACGGGACAC CCGCCCGGGC GGACGCACCC GGGCACGCTG GGACGTCCGG GCACGCGGGC
AGGTCCGGGC ACGCGGGCAC GTCCGAGCGG GCGGGCGACC GATGA
 
Protein sequence
MLEAAGYLLV AVLLIAGNAL FVAAEFALVA VEPHQVEEAA NTGDRRAGIV LRAVRSLSFQ 
LSGAQLGITV TSLVVGYIAE PAVATLLEPL LDVAHIPPSP RDVTAIVLGL VVATVTQMVF
GELVPKNWAI SEPVRVARAV APAQVMFSRV FRPLITLLNG SANALLRAMG VEPQDELRSG
RSSDELSSIV ASSAEHGTLP VTTAALLSRS LRFGDRRASD VMTPRVRTVF AGAGTSLAEL
LRLAEHTGHS RFPVLREDDE IGEDGYGVVG VVHVKDAFGV PAPERARRTV PEIMVEPLLV
PASLHCEVLL RRLRRGGLQL AVVIDEYGGT DGIVTMEDLV EELVGDVDDE HDRPAPPDAV
ALGAGQWMLS GLLRLDEVSE ATGARLPAGP YETIGGLVLA RLGRLGRPRD VVQVEGHELV
VASVDGHRID RVRLSPTEAT DGTPARADAP GHAGTSGHAG RSGHAGTSER AGDR