Gene Franean1_5253 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5253 
Symbol 
ID5673587 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6314658 
End bp6315725 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content75% 
IMG OID641244108 
Producthypothetical protein 
Protein accessionYP_001509517 
Protein GI158317009 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.973786 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.278605 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGACC CGCGCCGCCG CCGGCGCCTG ATCCTCGGCG CGGGCGCGCT GCTCGCCGTG 
ATCCTCGGGG TGGGCTTCAT CGCTGCCGGA ACGGACGGCG CCGGCACGAA CTCGGCGGAC
ACGGCGGCCG TCCCGATGCC GGCGTCCGAG CCCGGTACAC CCGGCGAGAT GAGCACGTCG
GCGGACTCCA GCGCGTCGGC GTCGGCGTCG GCGCCGTCGT CGGCGCCCGG TGCGGCCCGG
GCGGACGGCA TCGCCGCGGG AGCGCCGTCG ACCGTCCCGG GCGGGCCCGG TGGCACCGGC
GGCACCGGCG GCGAGCCCGC CCGGCCCGCC GGCGCCCAGC CGCGGATCGT CCGCAACGGC
ACGGCCACGC TGTCCGTGCC GGCCGGTGCC GTGGACAAGG CGGTCCAGGA TCTCTCCGCG
GCGGCGCGGG GGCTTGCGGG GTACACCGAG TCCAGCGAGG TCAGCGGCAC TCCGTCGACC
ACTGATGACG GCAGCCAGTA CGCCACCGTG ACCCTGCGGG TGCCGAGCGA GTCGTTCGAC
GAGCTGCGGT CCGGCCTGAG CCGGATCGGC ACGGTGTCGG CGTCGACGAT GTCCTCGCGC
GACGTGACCG GGGAGTACGT CGATCTCGAG GCGCGCAAGC GCGCGCTGGA GGCCTCCCGC
ACCGCCTACA CGACGCTGCT CTCCAATGCC ACCACGGTGG GGGAGACGCT GTCGGTGCAG
CAGGCCATCG ACGGCGTGCA GATCCAGATC GAGCAGATCG AGGGCCAGCG GATGGTCCTC
GCCGACGCCA GCGACCTCGC GACGTTGACG GTGCAGATCG CCGAGGACGG AGCGGACCCC
GCACCCGGGC CGGACGATGA CGACTCGGGG CTGGTCGCTG CCGCGCGGAC ATCCTGGAAC
CGTTTTGTCC GCGGTATCGA GGAGATCATC GCGCTGCTCG GCCCGCTGGC GCTGGTCGGC
CTGGTCGCCG CGTGCGTCTA CGGGGCCGTC CGGATCGCGC GCCGGTGGGG CTGGATCCCG
ACGACCCCGG CCCCGCCCGC GCCGCCGCGG GACTCGGCGG GGTCGTAA
 
Protein sequence
MPDPRRRRRL ILGAGALLAV ILGVGFIAAG TDGAGTNSAD TAAVPMPASE PGTPGEMSTS 
ADSSASASAS APSSAPGAAR ADGIAAGAPS TVPGGPGGTG GTGGEPARPA GAQPRIVRNG
TATLSVPAGA VDKAVQDLSA AARGLAGYTE SSEVSGTPST TDDGSQYATV TLRVPSESFD
ELRSGLSRIG TVSASTMSSR DVTGEYVDLE ARKRALEASR TAYTTLLSNA TTVGETLSVQ
QAIDGVQIQI EQIEGQRMVL ADASDLATLT VQIAEDGADP APGPDDDDSG LVAAARTSWN
RFVRGIEEII ALLGPLALVG LVAACVYGAV RIARRWGWIP TTPAPPAPPR DSAGS