Gene Franean1_1642 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1642 
Symbol 
ID5670044 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1957924 
End bp1959321 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content70% 
IMG OID641240560 
Producthypothetical protein 
Protein accessionYP_001505986 
Protein GI158313478 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.121155 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGCAC AGGAGGGCCA GCAGGCGCTG GAGGCACAGC AGCCGCTGGA CGCGCTGGAG 
GCGCAGTTCG CGCAGTGGCG TCACTACGCG CAGCGCCGCC GGGAGCTGCG GACCGCCGAC
GCCGACGAGC TCGAGGACCA TCTCCGCGGT TCCGTCGACG AGCTCGTCAT GGCCGGCCTG
AGCGCCGACG AGGCGTTCCT GGTCGCGGTC AAACGGATGG GCAGCCTCGA CGAGCTGTCC
CGCGAGTTCG CCCGGGAGCA TTCGGAACGG CTGTGGAAAC AGCTGGTCCT GACCGGCGGC
CCGGCCGCGG ACACCCGCTC GCGGCGCGAC CTGCGGATGC TGGTGCTCTG CGCCGCGGGC
GCGGCGGTGT CTGTCAAGGC GCCGGAGCTG TTCGGCGTGC GGATGACCGA CGACGGTTCG
GCCTCGTTCT ACGGGCCGAA CCTCAGCCTG TTCGTCCTGC CGTGGCTGGC CGGCTTCCTC
GCCTGGCGCC GCCAGGCCCC GCGCCCGCTG GTCGGGATCC TGGCGGCGCT GTTCGCGCTC
GGCGCGGTGG CGGCCAACGT CTACCCGCTC GGCGACGATT CGCAGTCGGT GGTCGTCACC
AGCATCCACC TGCCGATCGC CCTGTGGCTC GTGGTGGGCC TGGCCTACGC CGCGGACGAC
TGGCGGTCGT CGCGCAGACG CATGGACTTC ATCCGCTTCA CCGGCGAATG GTTCGTCTAC
TTCGTCCTCA TCGCCCTCGG CGGCGGTGTG CTCACCGTGT TCACGGCCGG CACCTTCGAA
GCCATCGGAA TCGTTTCGGA CGACTTCATC TCGCAGTGGC TCCTTCCCTG TGGAGCGGCA
GCCGGGGTCA TCGTGGCCGG GTGGCTCGTC GAAGCGAAGC AGAGCGTGGT GGAGAACATC
GCCCCGGTGC TCACCAGGCT GTTCACCCCG TTGTTCACCG TGGTCCTGCT GGCCTTCCTC
ATCGCCGTCT GCTGCACCGG CACCGGCATC GACGTCGAGC GGGACGCGCT GATCCTGTTC
GACCTGCTGC TGGTCGTCGT CCTGGGGCTG CTGCTCTACT CGATGTCAGC CCGCGATCCG
CTGGCCCCGC CCGACCTGTT CGACCGGCTG CAGCTCGCCC TGGTGGTGAG CGCGCTGGCC
ATCGACGTGC TGGTCCTGCT GGCGGTCACC GGGCGCATCA CCGAGTACGG CACCACGCCC
AACAAGGCCG CGGCGCTCGG GGAGAACGCC ATCCTGCTGG CGAACCTCGC CTGGTCGGCG
TGGCTCCTGC TGAAGCTGGT CCGCCGGCAC ACGCCATTCG CGGCGCTCGA ACGCTGGCAG
ACCTCTTACC TGCCGGTGTA CGCCGTCTGG GCCTGGATCG TGGTCCTCGC CTTTCCTCCG
TTGTTCGGCT ATGCCTGA
 
Protein sequence
MTAQEGQQAL EAQQPLDALE AQFAQWRHYA QRRRELRTAD ADELEDHLRG SVDELVMAGL 
SADEAFLVAV KRMGSLDELS REFAREHSER LWKQLVLTGG PAADTRSRRD LRMLVLCAAG
AAVSVKAPEL FGVRMTDDGS ASFYGPNLSL FVLPWLAGFL AWRRQAPRPL VGILAALFAL
GAVAANVYPL GDDSQSVVVT SIHLPIALWL VVGLAYAADD WRSSRRRMDF IRFTGEWFVY
FVLIALGGGV LTVFTAGTFE AIGIVSDDFI SQWLLPCGAA AGVIVAGWLV EAKQSVVENI
APVLTRLFTP LFTVVLLAFL IAVCCTGTGI DVERDALILF DLLLVVVLGL LLYSMSARDP
LAPPDLFDRL QLALVVSALA IDVLVLLAVT GRITEYGTTP NKAAALGENA ILLANLAWSA
WLLLKLVRRH TPFAALERWQ TSYLPVYAVW AWIVVLAFPP LFGYA