Gene Franean1_5242 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5242 
Symbol 
ID5673576 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6303469 
End bp6304755 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content70% 
IMG OID641244096 
Productphosphoesterase PA-phosphatase related 
Protein accessionYP_001509506 
Protein GI158316998 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACGTT GGGCGCGCGC CCGCCGTGGA GTCGTGTGTA CGGCCGTCGT CGGTGTCCTC 
GCCGCGGCGT TACCGGTCGC GGCCGGGCTC CCGGCCGCGC GGGCGGCCGG TGAGAACACA
TCCACCAATG TCGTCATCAT CTGGGACCGC AATGCGCAGA CCGCGATCTG GGACGTCGCC
GGCCAGCAGC CGCAGGTCCA GGCGCGCAGC TTCGCGATGG TGCACGGGGC CGTTTACGAC
GCGGTGAACG CCATCGCCGG GCGGCCTTAC CAGCCTTATC TGCTCGCCCC GCGGGCCAGC
GGGCGCGAGT CGACGGACGC CGCCGTCGCG ACCGCCGCCT TCCAGGTACT CAGCTCCCTG
TTCCCGGCCC AGCGGCCGCG GCTGCAGACG CAGTACGACG AGTGGATGGC GAACCTTCCC
GACGACGCGG CGAAGCGGAG CGGGACCGCC GTGGGTGGCC AGACCGCCGC AGCGATGATC
AGTGCTCGGC AGAACGACGG GGCCTTCGGT AATCCGACCT GGCCGGTGGG CACCCAGCCC
GGCCAGTGGC GGCCGACTCC GCCGACCTTC GCCTCCGACA CGGCCTGGGT GGCGAACCTC
AGGCCGTTCC TGATCCCGAG CGCGTCGATG TTCCGCTCGG CCGGGCCGCC GGCGCTGACC
TCCGAGCGCT ACGCCCGGGA CCTCAACGAG GTCAAAACGA TCGGCGCCGT CAACAGCACG
ACCAGGACGC TCGACCAGAC CCAGGCGGCG ATCTGGTGGC ACGACCGGCA CCTGGGTGAA
TGGGAGATCA AGCGCCAGCT CGCCACGGGC CGCCGTCTGA GCACCCTGCA GACGGCCCGC
ATGTTCGCGA TGGTCGACCT CACCGAGGCC GACGCGACGA CCGCCTGCTT CAACGAGAAG
GCGGCCTGGA CGTCCTGGCG GCCAGTCACC GCGATCCAGC TGGCCGACAC CGACGGCAAC
CCGGCAACCA CCGCCGACCC GACCTGGGCA CCGCTGCTCG TCACCCCGCC ACACCCCGAC
TTCACGTCCG GGCACACCTG CTTCACGACG GCGAGCATGT CGACGCTGGC GTTCTTCTTC
GGCCGGGACG ACATCCCGTT CAGTGCGTAC AGTGCCGATT CGGGTACCAC ACGCTATTTC
CGTGGTTTCT CCCATGCCAT CGCCGAGGTG ATCGAGGCTC GCGTCTGGGG TGGCATCCAC
ACTCGGTCGG CCGACACCGA GGGCGCGAAG ATCGGCGCCA AGGTGACCGC CTACGCGACC
AGGAACTATT TCCGCCCGCG GCGTTGA
 
Protein sequence
MARWARARRG VVCTAVVGVL AAALPVAAGL PAARAAGENT STNVVIIWDR NAQTAIWDVA 
GQQPQVQARS FAMVHGAVYD AVNAIAGRPY QPYLLAPRAS GRESTDAAVA TAAFQVLSSL
FPAQRPRLQT QYDEWMANLP DDAAKRSGTA VGGQTAAAMI SARQNDGAFG NPTWPVGTQP
GQWRPTPPTF ASDTAWVANL RPFLIPSASM FRSAGPPALT SERYARDLNE VKTIGAVNST
TRTLDQTQAA IWWHDRHLGE WEIKRQLATG RRLSTLQTAR MFAMVDLTEA DATTACFNEK
AAWTSWRPVT AIQLADTDGN PATTADPTWA PLLVTPPHPD FTSGHTCFTT ASMSTLAFFF
GRDDIPFSAY SADSGTTRYF RGFSHAIAEV IEARVWGGIH TRSADTEGAK IGAKVTAYAT
RNYFRPRR