Gene Franean1_2478 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2478 
Symbol 
ID5670874 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2954378 
End bp2956135 
Gene Length1758 bp 
Protein Length585 aa 
Translation table11 
GC content72% 
IMG OID641241395 
Producthypothetical protein 
Protein accessionYP_001506816 
Protein GI158314308 
COG category[R] General function prediction only 
COG ID[COG1568] Predicted methyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.627901 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGTCA ACATGACCGG CGAAAAATCC TCGACGGACG AGTCCTTGGC CGATAAGGCC 
CCGATGGACA ACGCCGCCGC CTATGTCGCC GGCTACGGTG TGCGATCCCG CCCACTGCGC
GAAATTCTCT CGCTGTTGAC CGGCGGCAGC CAGCCGATCG ACGTTCTGAT AACGCGGACC
GCGACACCGC GGCGGGCGGT CGAGGAGCTT CTGCGTAGCC TTGGCCCGGA TCTCACCGAA
AACAAGCACG GTTACCGCCT ACGGGCGGAA GTCATCGCCG AATATCGCGC CCGATTCGCC
CTGGACGGCC TGGAGCCGGC CGGCGCCGCG GGCGATGACC GCGACGGCGA CGGGCTCGGC
GCCCTTGCCC TGCTGATCAA GAACGCCCCC GCACCCCGCC GGGACCTCGA CCATGTGGCC
GCGACAGCGG GAACGCTCGC CCGCCGCGCG ACCTGGCTCG ACACCACCTA CGACCTGGCC
GGCCGGCGGG TGCTGTTCGT CGGCGACCAC GACCTGACCT CGGTGGCGCT GGCCCGGCGG
CAGCCCGGCG CCGAGATCAC CGTGGTCGAC CTGGATGAGC GGACCCTCGC CTACATCGAC
GCCACGGCAC GCTCCGAGGG ACTGTCGATC CGCACGCTGT TCGGCGACCT GCGATTCACC
CTGCCGCCCG CCGCGCGGGA GTGGGCGGAT CTCGTCCTCA CCGATCCGCC GTACACCCCC
GAAGGGGTCG GTCTTTTCCT CGGGAGAGCG CTCGCCGGCC TGGGTGACCG GAAAAACGGA
GTCGTCGTCG TCGCCTACGG CCACAGCCGG CTCCATCCGA TGCTGGGTTT TCAGGTACAG
CAGTCCATGC AGCAGTTCGG TGTCGTCTTC GAAGCCATAC TGCCGGCATT CAACCGTTAT
GACGGCGCGC AGGCCGTGGG AAGTGCCAGC GATCTGTACG TGTGCGCGCC GACCAGCCGC
ACGTGGAAAG TCCTCGAGCG GGCGGTGGAG AGCTTCGGAA CGCGCATCTA CACCCACGGG
ACGCAGTCCG TGGAGAGCAC CGCGGCGGTC GAGCTCGGGC CGGCCGCGAC GGTGATCGGC
GACGCCACCG CCGCCGGATC GCCGGGCGCC CGCCGCCTCG GCCTGCGCTC GCTGTTCACC
GGCTCGGACT ACCTTCGCGA CCTCGGCGAC AACGCGGACG TCGCCGTGGA CCTCACCGCC
GACCCCGGTC CGCTGCTGCT GCGAGCCCTG CTCGCCGTGA CCGCCCGGAA GGTCCGGTTC
GTCGTGCCGG TCGACCATCC CGACGTGTCG ACCCCCGGCG CCCGGGCGGC TCTCGCGAGC
CTGGTCGCCC CGAAGTACCG CCTGACGTTC CCCCCACCGG CCTCAGCGGG ACGTCCCCGA
GGCGAGGCCG GGCACGAGGG CGGCGGATAC GGGGTCGTCC ACGCGGACCT GGTCGACGCG
GAGGCGGACA CCGGCGGCGA GGCGGACGTC GACGTGGGCG CCGGCGGTCC GACCCGGCCG
CCCGCCCCGG ACGTAACCGC GCGTTGGCTG CTCGAGCGGG CCCACGGCCG GATCGGGAAC
GTGCTGCGCG AGGGCGTCAT CCGTGCGGCC GCCCGGGACG GCCGGGCGAT CTCCAAGAAC
GACGCCCGCG CGCTCGTCCG AGCTCAGGTG GGCACAGCCG ACCTCGACAC GCTGGACCTG
ACGGCGATCG AGACCCCCCG CGCACGCCTG GAGCGGGTGC TCCAGGCCGT CCGCGCGCCG
GGGCCGGCTC GATCTTGA
 
Protein sequence
MNVNMTGEKS STDESLADKA PMDNAAAYVA GYGVRSRPLR EILSLLTGGS QPIDVLITRT 
ATPRRAVEEL LRSLGPDLTE NKHGYRLRAE VIAEYRARFA LDGLEPAGAA GDDRDGDGLG
ALALLIKNAP APRRDLDHVA ATAGTLARRA TWLDTTYDLA GRRVLFVGDH DLTSVALARR
QPGAEITVVD LDERTLAYID ATARSEGLSI RTLFGDLRFT LPPAAREWAD LVLTDPPYTP
EGVGLFLGRA LAGLGDRKNG VVVVAYGHSR LHPMLGFQVQ QSMQQFGVVF EAILPAFNRY
DGAQAVGSAS DLYVCAPTSR TWKVLERAVE SFGTRIYTHG TQSVESTAAV ELGPAATVIG
DATAAGSPGA RRLGLRSLFT GSDYLRDLGD NADVAVDLTA DPGPLLLRAL LAVTARKVRF
VVPVDHPDVS TPGARAALAS LVAPKYRLTF PPPASAGRPR GEAGHEGGGY GVVHADLVDA
EADTGGEADV DVGAGGPTRP PAPDVTARWL LERAHGRIGN VLREGVIRAA ARDGRAISKN
DARALVRAQV GTADLDTLDL TAIETPRARL ERVLQAVRAP GPARS