Gene Franean1_2052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2052 
Symbol 
ID5670453 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2471848 
End bp2473143 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content77% 
IMG OID641240974 
Producthypothetical protein 
Protein accessionYP_001506395 
Protein GI158313887 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.618886 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.624101 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCACGA AGGACGGCTC CCGCCCCACG GCGCTGCCGC CGATCGCAGT GCGCCCAGGC 
CTGTTCGTGT CCGTGGCGGT CGTGACAGTG CTGCTCGGCG CGCTCACCCT CCCCGCCACC
GTGCCGGGCC GTCCGGGCTT CGCCTACTTC AGCGGGGGCG TGCTCGGCGC GGGTCTGCTC
GTCGCGATCC TGCTCGGGGC CGACCTGGCC CGGGCCGCCG CCGCGCGCAG GGCCGGGATC
ACGGTCACCG GGATCACCCT CGGCGCCTTC GGGAGCCGGC TGGGCCTCGC ACCCGCACGC
GACCGCTCGA CGGACCGGGG TGACGGCAGC CCGCTGAGCG GTACCGGCCC GTCGGGCGGT
AGCGCCCCGT TGGGCGGTAC CGGCCCGGCC GGTTCCGATG ACCCGCTCGC CCCGGCCACC
GGTGACGCAC TCGCCGACGC CGCCGTCGCC CGTGCGGGCC TGATGGTGAC GGCCCTCGCC
GGGATGGTGC TGGTCGCCGC CGGCGCGTTC GCCCCCGGCG GAACGCTCGC CCTGGTCGGC
GAGCTGGCGC TCTGGGTCGG CACGTTCGCC CTGCTCATCA CCGTCGTCGA CCTGCTGCCC
GCACCGCGCA GCGCCGGCGG GCGGATCCTC GCCGCCCGGG TCCTGCGCCG CACCGGTGAC
GAGGCGGCCG CGGGCGCGGC CGTGGCCCGC GCGGGTGTCA TCACCGGGTG GACCCTGATC
GTCTTCGGTG CGGCGGCCAC CTTCCTGGTC GGGCTGGTCG GCCTGTGGGC GATTCTCCTC
GGCTGGCTCG CGCTCGGGAC GTCCCGGCTC GCGCAGACGC AGGAGCGCAC CTCCGCCGCG
CTGCGCGGGG TCTTCGTCCG TGACGTGATG ATCCCCGCCC CGGAGGCGCT GCCGTCCTGG
AAGACAGTCG CCGCCGCGCT GGACGAGACC GTGCTCCCGT CCCGCGCCTC GGTGTTCGGG
GTCCGGGACT TCGCCGGGCC GCTGATCGGC GTCACCCTGC TGCGTGATCT GGCCGCGGTG
CCCGCGGACG ACCGCGACCT GGCCCGGGTG GCCCGGGTGA CCATCCCGCT GGACCGGGTC
GCCACCGCCC GTCCGGAGGA GCCGCTCGCC GCGGTCGCGT CGCGGCTGGC GCACCGGCCC
GCCGCGGGCG TGATCGTCGT CGTCGCCGAC GGCCCGGATG GCCTGCCCGG GATGGTGGGC
ACCGTGGGCC CGGGCGAACT GGCCCGGGCG CTGGAGACCA CGCCGCTGCA CGGCCGGGTG
GTCATCCCGA CCGGCTTCGG CCGCCGCCGC CGGTGA
 
Protein sequence
MSTKDGSRPT ALPPIAVRPG LFVSVAVVTV LLGALTLPAT VPGRPGFAYF SGGVLGAGLL 
VAILLGADLA RAAAARRAGI TVTGITLGAF GSRLGLAPAR DRSTDRGDGS PLSGTGPSGG
SAPLGGTGPA GSDDPLAPAT GDALADAAVA RAGLMVTALA GMVLVAAGAF APGGTLALVG
ELALWVGTFA LLITVVDLLP APRSAGGRIL AARVLRRTGD EAAAGAAVAR AGVITGWTLI
VFGAAATFLV GLVGLWAILL GWLALGTSRL AQTQERTSAA LRGVFVRDVM IPAPEALPSW
KTVAAALDET VLPSRASVFG VRDFAGPLIG VTLLRDLAAV PADDRDLARV ARVTIPLDRV
ATARPEEPLA AVASRLAHRP AAGVIVVVAD GPDGLPGMVG TVGPGELARA LETTPLHGRV
VIPTGFGRRR R