Gene Franean1_0010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0010 
Symbol 
ID5668437 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp15821 
End bp17065 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content75% 
IMG OID641238939 
Producthypothetical protein 
Protein accessionYP_001504385 
Protein GI158311877 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.225832 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGGCCAG GTGAGTCGGC GACGGGTGCT TCCCTCCTCT CCCCCGAGGC GCTGCGCGCC 
ACCGCGGACG CGATAGCCGC GGCCCAGGAG CCCGACGGGG CGATCCCGTG GTTCGCGGGC
GGCCACACCG ACCCGTGGGA CCACCTGGAA TGCGCGATGG CGCTGCTGGT CACCGGCCGG
GTCGACGCAG CCGACGCCGC GTGGGACTGG CTGCACCGCC GCCAGCGCCC GGACGGCTCC
TGGGCCACCA GCCACGTCGG CGGAGCAGTC AAGGAGGACT TCGCCGACAG CAACCAGTGC
GCCTACGTGG CGACGGCCCT GTGGCACCGC TGGCTCGTCA CCGGCGACCG CGCTTTCGTG
ACCCGGATGT GGCCGGTCGC CCGGGCTGCC CTGGACTTCG TCGTGGACAT GCAGGCCCCC
GGCGGCCAGA TCTGGTGGGC CCGCACCCCG ACCGGCGAGG ACTACCCCGA GGCACTCGTC
ACCGGATGCT CGTCCACCCT GCACAGCCTG CGGTGCGGCC TCGGCCTCGC CGCGCTGGTC
GGCGAGGCGC GGCCGGAATG GGAGGTCGCC GCCGGCGCGC TGTGGCACGC GCTGCGCCGC
CATCCCGAGT ACTTCATGCC GCGGGATCGC TGGTCGATGG ACTGGTACTA CCCGGTGATC
GGCGGCGCGC TGCGCGGCGC GGAGGGGCGG GCCCGGCTGC GCTCCCGGTG GGACGAGTTC
GTCGTGCCCG GTCTCGGTAT CCGCTGCGTG GACGACGAAC CATGGGTCAC CGGCGCCGAG
ACCTGCGAAC TCGCCATCGC GCTGCACCTG GTGGGGGAGA CCGAGGCGGC GGCCGGCCTC
GTCCGGGAGA TGCAGCACCT GCGGGCCCCC AACGGGGCCT ACTGGACCGG CTGGCAGTTC
GCCGACGGAT GCCACTGGCC GGAGGAGCAG TCAACCTGGA CGGCGGCCGC CGTCGTACTG
GCCGTCGACG CCCTGGCCGG CGGCCCCACC GAACGCACCT TCCGCGGGGA CGACCTGCCC
GAGGGCCTGC ACGTCGTGCA CCGCGACGAC CTGCCCGAGC CACGCGACCC GTCGACGCCC
CGCACCGGAC GAGGCACGGC CGGCGAACGG GCGCCGTGCG GCGAGCGGCG GGCCGGCGGC
CTCGCCACCC CCGGTCGGGG CGGCGAGGCG CTCACCGGTC CGTGGCGCGC GTGCGGGTGT
GACTCAGAAG AGCCGGCGGA GGACCCGCAG CGATCCCGTG CGTGA
 
Protein sequence
MRPGESATGA SLLSPEALRA TADAIAAAQE PDGAIPWFAG GHTDPWDHLE CAMALLVTGR 
VDAADAAWDW LHRRQRPDGS WATSHVGGAV KEDFADSNQC AYVATALWHR WLVTGDRAFV
TRMWPVARAA LDFVVDMQAP GGQIWWARTP TGEDYPEALV TGCSSTLHSL RCGLGLAALV
GEARPEWEVA AGALWHALRR HPEYFMPRDR WSMDWYYPVI GGALRGAEGR ARLRSRWDEF
VVPGLGIRCV DDEPWVTGAE TCELAIALHL VGETEAAAGL VREMQHLRAP NGAYWTGWQF
ADGCHWPEEQ STWTAAAVVL AVDALAGGPT ERTFRGDDLP EGLHVVHRDD LPEPRDPSTP
RTGRGTAGER APCGERRAGG LATPGRGGEA LTGPWRACGC DSEEPAEDPQ RSRA