Gene Franean1_0219 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0219 
Symbol 
ID5668644 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp266560 
End bp268047 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content78% 
IMG OID641239148 
ProductSCP-like extracellular 
Protein accessionYP_001504592 
Protein GI158312084 
COG category[S] Function unknown 
COG ID[COG2340] Uncharacterized protein with SCP/PR1 domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.20701 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACTCAG AGCGTCCTGA ACGTGGCCGG GCTGCACCGT CCGGTCCGGG CTTGCGTGTA 
CGGCCGGGCT CGAATGCACG ACCGAGCGCG CATGCGCGGC CGGCCCTGCG TGCGCGGAGT
TCGCGCGCCG CGGACCCGCC CGGCCCCGCG GGCCCGGTCC AGGACGTGGT CCGTGGCCGG
CGGCACCGAA GACGTCCCAC GCCCAGGCGG GCGGGCGGGC TGCGCGCCGC CGCGGGGCTG
GTGGCGCTGA CGCTGGTCAC GCTCGGGCTC GTGGCGGCCT CGGGCGAGGT CCCCTCGACG
TCGAGCGCGG CGGGCCGTGG CCTCTCCGGG TCCAACAGGG TTGTCGCGGA CGGGTCCGTC
GCGGGCGGGA CCGGTGCGGG CACCGCGGAT CACACCGCGG GCACGGATGC CGTCGGCGCA
CCGCCTGGCA GTGCGTCGCG GAGCGGGCGG GGCGCCCTGG CGGGGTACGG CGGGTCGTCC
GCCGATCCGG TGTGGCCGGC CGACCGGGGG CCGGTGAGCG GTGTGCTGGT TCTGCGGGCC
GATCCCGCGC TGCTCGCGAT CATGGCCGCC GACGGGCAAC CGGGCTCGGC ACCGGGCGAG
CCCTGGGTCC TCTACCTGCT CTCGGGCCCC GTGAACGTGA TCCGGTGGGC GATGGGCGAG
CCGTTCGAGG CAAGCATCGA CACCCGGCAG CTGCCGAACG GCGACTACAC CCTGTCCGAG
GTGATCTTCC GTGCGACGCA CGCCCCGCTG GTCCGCACCG GCCGGGTGCC CGTCGCGAAC
CCCCTCCCGC CCGGGGCGGA CCAGACGGCT CCGGGCGGAG CCGCGCCGCG GTCAGCGGCG
GCCGCCGCCG CCGATCCGGG CGACGGCACG GCCCCGGGTA CGCCGGCCGG CCCTGCCGGG
TCGGCCTCCG CGGGAGTGTC CGGCTCCCCG ACGGGCGCCG GTTTGCCGTC GGGCGCCGAG
CCGGCGGCGG GCGCGAGCAC CGCGGCGGGC GCGAGCACCG CGGCGGGCGC GGTGGCCGGG
CCGGCACCGG CCCCCGTCGC TTCCGGCGCC CGGCTCGCCG GGACCCCGAC CGGCGCGGGC
GGCGGTGCCG GGTCGGCGGC GACGGCCGCG CTGATCGAGG AGGTCGTCAC GCGGACCAAT
GCCCAGCGCT CGGCCGCGGG CTGCCCGGCC CTCACGGTCG ACGCCCGCCT GGCGGCTTCG
GCCCAGGAGC ACAGCGCGGA CATGGCGGCC CGGAACTACT TCGACCACAA CGGCCGGGAC
GGGCGGTCGC CCTTCGACCG GATCGCGGCG GCGGGCTACG TCTTCTCGAT CGCGGCGGAG
AACATCGCGG CCGGCCAGCG GACCCCGGCC GACGTCGTCG CGGACTGGAT GGCGAGCCCC
GGCCACCGGG CGAACATCCT GAACTGTTCG CTCAGCCAGA TCGGTGTCGG GCTCGCCACC
GGAGGTGACT ACGGCACCTA CTGGGTCCAG GATTTCGGCT CGCCGTAA
 
Protein sequence
MHSERPERGR AAPSGPGLRV RPGSNARPSA HARPALRARS SRAADPPGPA GPVQDVVRGR 
RHRRRPTPRR AGGLRAAAGL VALTLVTLGL VAASGEVPST SSAAGRGLSG SNRVVADGSV
AGGTGAGTAD HTAGTDAVGA PPGSASRSGR GALAGYGGSS ADPVWPADRG PVSGVLVLRA
DPALLAIMAA DGQPGSAPGE PWVLYLLSGP VNVIRWAMGE PFEASIDTRQ LPNGDYTLSE
VIFRATHAPL VRTGRVPVAN PLPPGADQTA PGGAAPRSAA AAAADPGDGT APGTPAGPAG
SASAGVSGSP TGAGLPSGAE PAAGASTAAG ASTAAGAVAG PAPAPVASGA RLAGTPTGAG
GGAGSAATAA LIEEVVTRTN AQRSAAGCPA LTVDARLAAS AQEHSADMAA RNYFDHNGRD
GRSPFDRIAA AGYVFSIAAE NIAAGQRTPA DVVADWMASP GHRANILNCS LSQIGVGLAT
GGDYGTYWVQ DFGSP