Gene Franean1_1056 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1056 
Symbol 
ID5669470 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1239228 
End bp1240496 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content68% 
IMG OID641239985 
Producthypothetical protein 
Protein accessionYP_001505418 
Protein GI158312910 
COG category[S] Function unknown 
COG ID[COG2899] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.969674 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTGGGTT CCGTACGGAC CGCGGAGCTC GTCCGTAAAC GCCTGTCGGA GCACTGTGAG 
CCGGTCCGGG AGGTGTTCCA AGGAGTGCAC GCACTTCGAA TTTTCGGCTG GTCCCTCGCC
ATCACGGTGA TCGGGGTCGC GGGGGCTGGT GTGATCGGCG GCGGGGACGC CGCGGCGATC
GTCGCGATCC TCGCCGTCCT CGAGATCAGC CTCTCGTTCG ACAACGCGGT CATCAACGCG
ACGATCCTGC GCCGGATGAG CGAGTTCTGG CAGCGCATCT TCCTCAGCGT CGGCGTCATC
ATCGCCGTGT TCGGGATGCG GCTGCTGTTC CCGATCGTGA TCGTGGCGCT GACGGCGCAT
CTCAGCCCGG TGGACGTCTT CGACCTGGCG CTGAACCACG AGGAGGAGTA CGGGGCCCGG
CTGCACGACG CGCACCCCTC GATCGCCGCC TTCGGCGGGA TCTTCCTTTT TATGATCTTC
CTTGACTTCA TGTTCGACCC GGAGCGCGAG ATCCAGTGGA TCAAGCGCAT CGAGGAGCCG
TTCCGCCGGG CCGGCCAGCT CGATGTCGTG TCCGTCGTGC TGGGGCTCGT CGCGTTGCTG
GTCGTGGGGG AGGCCTTCTC CGGCGACCAC ACCCAGCAGG TGCTGACGGC GGGGGTCGCC
GGTCTCGCCA CCTATCTGGG TGTCCGCGGG CTGGGCGAGT TCTTCGAGGC CCGCGGAATC
GGCGCGGACG ACGACGAGGA CGACGAGGAC GAGAAGGCCG GGGCCAACGG CTCGGCACCC
GGTCGCACCA CGGGAACCTC GGACGTGGTC CTTGCGACCG GCCGGGCCGC CTTCTTCCTG
TTCCTCTACC TCGAGGTGAT CGACGCGTCG TTCTCGTTCG ACGGCGTCGT CGGGGCGTTC
GCCATCTCGC AGAACATCTT CATCATCGCG GCCGGCCTGG GTATCGGCGC CATGTACATC
AGGTCGACCA CGGTGTACCT CGTCCGGCGC GGGACACTGG GCGAGTACAT CTACCTGGAA
CACGGAGCGC ACTACGCGAT CGGCGCGCTC GCCGTCATCC TGGCGGTCTC GATCGAGACC
GAGGTGCACG AGATCGTCAC CGGGCTGATC GGTGTGGCGT TCATCGGGCT GGCCCTGCTG
TCCTCGATCC GCCACCGCTC GAAGGAGCGG CAGGGGAACC TCGACGGCGG GGACGCCGCG
GCCGCGGGTG ACCAGCCCGG GGACGCCGGC GATCCCGAGG ACGCGCCGGT CGTCGGCACC
CGGAGCTGA
 
Protein sequence
MVGSVRTAEL VRKRLSEHCE PVREVFQGVH ALRIFGWSLA ITVIGVAGAG VIGGGDAAAI 
VAILAVLEIS LSFDNAVINA TILRRMSEFW QRIFLSVGVI IAVFGMRLLF PIVIVALTAH
LSPVDVFDLA LNHEEEYGAR LHDAHPSIAA FGGIFLFMIF LDFMFDPERE IQWIKRIEEP
FRRAGQLDVV SVVLGLVALL VVGEAFSGDH TQQVLTAGVA GLATYLGVRG LGEFFEARGI
GADDDEDDED EKAGANGSAP GRTTGTSDVV LATGRAAFFL FLYLEVIDAS FSFDGVVGAF
AISQNIFIIA AGLGIGAMYI RSTTVYLVRR GTLGEYIYLE HGAHYAIGAL AVILAVSIET
EVHEIVTGLI GVAFIGLALL SSIRHRSKER QGNLDGGDAA AAGDQPGDAG DPEDAPVVGT
RS