Gene Franean1_5025 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5025 
Symbol 
ID5673363 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6018806 
End bp6020251 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content70% 
IMG OID641243879 
Producthypothetical protein 
Protein accessionYP_001509294 
Protein GI158316786 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.898737 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.21169 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGTCCCTC TAGCTTCCTG GGATCCTGAC CAGCTGAGTC TCTACGGTTG GCTGGAAAAC 
CAGATCCTGC GCGAGTACCC ACACCTGTCC GCCGTAGATG CGGCCACCGG GCACAGTCGC
GCCGCGGCGC TGCTGCACGC TGGGCTGATC CTTCCTGTCC TCGACGGGCT CGATGAGATC
CGCGTGGGCG GCAGGGACCA GGCCCTGACT GCTATCAACG ACGGCCTGCG CAGCAACATC
GGACTCGTTC TGAGCTGCCG CGCCGAGGAG TTTCGTGCGG CCGTGCAATC CGAGCCCGAC
TGGCAGCCAA TCCACCTCGA CGGCGCCGCT GGCATCCGCC TAACGCCGCT CACTTCGGCA
GTGGTCGGGG ACTACCTTCT AGCCGGATCT GGCAACAACG GTTTCACCAG ATGGGAACAG
GCCCTCACCG CGCTGGCTGA CCCCACCACG TCCCTCGGCC AGGCGCTTTC CACCCCACTA
GCCGCCAGCC TCGCCCGCAC CATCTACAAC CCCCGCCCCG GCGAGTTCAT CCGCGGATTG
CCCGACCCCT CCGACCTGAC CACGTTGCCG ACCCGGCAGG CCGTCGAACA GCATCTCTTC
GACGGCTACC TGCCTGCCGC CTACCGGGCC CACCCCGACC AGCCCACTCG CTGGACCGCC
AGCCAGGCCA CCCGCTATCT CGTCTTCCTC GCCCATCACC TCGAACACCG CCTGGAGACC
ACCGCCCTCT CCTGGTGGGA GCTGTCCCGG GCCACCCCAC GAGCACTTTC GATCCTCGTG
TACGGGCTCA CCTCCGGGCT CGCATTCGGG CTCGCGGACA GGCCCGCGGT CGGACTCGCG
CTCGGACTCG TGTTCGGGCT CGCGGTCGGG CCCGCGGTCG AACTCCTGAT TGGCAGCGCC
TCCCCCGGCC GCATGGCTCT GCGACCGCCG CGGCTGTTCG ACCTGGCGGT AGGGCTCACA
ATCGGGGTCG CGGTCGGGCT CACGGACGGG CTCATGGACG GGCTGTCGGC CGGGCTGTCG
ACCGGACTCC CGTTCGGGCT CGCGCTCGGG CTCGTAGTCG GGATCAGACT TGACCCCACG
GCGGAGACGA GACGGGCGAC AGATCCCAGA ACCATCCTGG TCCAGGACCG GGCCAGCGGG
CTCGCAATGG GGCTCGTTCT CGGGCTCACG GTCGGGCTCA CCGTGGGGTT CACGGACAAT
GCGCTCACGG CCGGGCTTCC GGCCGGGCTT CCGGCCGGGC TTACAGCTAC CCTCACATTC
GCGCTCGGCT CCGCGTGGGG ACGGCTGGGG GTGACCCGCC TGTGGCTTGC CGCGCGGCGG
AAGCAGCCGC TGCGCCTCAT CGCCTTCCTC ACCGACGCCC ACGATCGCGG CGTCCTGCGC
CAGGCCGGGG CCGTGTGGGA GTTCCGGCAC GCCAACCTCC AGCGCCACCT TGCGGGTCCG
CCGTGA
 
Protein sequence
MVPLASWDPD QLSLYGWLEN QILREYPHLS AVDAATGHSR AAALLHAGLI LPVLDGLDEI 
RVGGRDQALT AINDGLRSNI GLVLSCRAEE FRAAVQSEPD WQPIHLDGAA GIRLTPLTSA
VVGDYLLAGS GNNGFTRWEQ ALTALADPTT SLGQALSTPL AASLARTIYN PRPGEFIRGL
PDPSDLTTLP TRQAVEQHLF DGYLPAAYRA HPDQPTRWTA SQATRYLVFL AHHLEHRLET
TALSWWELSR ATPRALSILV YGLTSGLAFG LADRPAVGLA LGLVFGLAVG PAVELLIGSA
SPGRMALRPP RLFDLAVGLT IGVAVGLTDG LMDGLSAGLS TGLPFGLALG LVVGIRLDPT
AETRRATDPR TILVQDRASG LAMGLVLGLT VGLTVGFTDN ALTAGLPAGL PAGLTATLTF
ALGSAWGRLG VTRLWLAARR KQPLRLIAFL TDAHDRGVLR QAGAVWEFRH ANLQRHLAGP
P