Gene Franean1_3447 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3447 
Symbol 
ID5671818 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4078128 
End bp4079762 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content67% 
IMG OID641242335 
Producthypothetical protein 
Protein accessionYP_001507755 
Protein GI158315247 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.157505 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTGGA TCCGTGACGG TAAGCCGCCG ACCAGCTGCG GCGACTGCCT GGCCTGGGGC 
GTCTTCGGGG GCCGGTACTG CCCGGCCTGC GCCAACTTCC GACAGCGGTT TCCCCGCGGC
ACGTGCGCCG GCTGCGCCCG CCTTCTCCCT GTGAAACGGG GCTACTGCCG GCTGTGCTGG
AAACAGGCCA CACTGGAAAC CGCCGGGCGT TGGGCACCCG CCATCCAGCC ATTCCTGGAA
GCCGTCCGCC ACCACCAACT GTTTCTCGCC AGCCTCCACC ACAGCGTGCC AGTTCCGGCC
GGCCGGCGCC TCGGCAAGAA CAGTCGAAAC GCGCCGGGCC TCACCTCACC CGCCGAAAAT
CCCGGCCGCG CCGTGGCATG GGTGCAACTT CGGCTAATCG ACCTGCCGCG GGACTTCACC
CGGTTCGACC GCCACAACGC CGACCTCACC AACCCACTGC TGGTCGACGC CCGCCGACGC
GCCCGCGCCA TGGGCGAAGC CCGAGGATGG ACCCGCCGCG TCCACACCGA TGTCGACCGG
GCGCTGGTCA TACTCCTGTC CGGGCTCGCT CCCGGCGAGA AAGTCCGCTA CTCCGACATG
TTTCCCGCTC TCCAAGCACG TTGGATCAGC GTGGAGCGGA CCGTTCAGGC GTTGGACCAT
CTCGGGCTGC TCGACGATGA CCGGCAGTCC ACCTTCGACG CCTACCTCGA GCACAAACTT
GACGGCATCA CCCCGGGCAT CCGTCGCGAC GTCGAGGACT GGATCCGAAC CCTGTACGCC
GGCGGCCCGC GAACCCGCGC GCACAGCAAA AACACCGCCT ACGGCTATCT CAACGAGATC
AAACCCACCC TGCTGGACTG GTCGACCCGG TTCCATCACC TCCGCGAGAT CACCGGCGAC
GACATCCAGA AGGTCATCAG CTCCGTCCAC GGCAACAAAC GCGACCACAC CATTGTCGTG
CTGCGATCCC TGTTCGATCA CTGCAAGAAA ACCGGCACCA TCTTCCGTAA TCCCGTCGCG
CGGCTACGCG CCGGCCGCAA ACACTACAAC CTCATCCTCC CGCTCCACCC CGAGCGGGTC
AGCATGGTTC TTGATGCCGC GACCAGTCCC GCAGCCCGGC TCGTCGTCGT CCTCGCCGGA
ATCCACGCCG CCCGCAACAA GACAACCCGC CACGTGCAGC TCGACGATGT CGACCTCGGC
AACCGCCGCC TCGTCATCGC CGACATAAAC CGACCACTCG ACGACCTCAC CTACCATGCC
GTCCTGGACT GGCTCGCCTA CCGCCGCGAC CGATGGCCCA ACACCGCCAA TCCCCATCTG
ATAGTCAATG GACAGACCGC GCTGGGACAC GGTCCCGTCA GCGACAGCTG GCTATCCCTG
ATCGTCCGAG GCCTGCCCGT CACCCTCGAA CAACTACGCG TCGACAGACA GCTCGACGAG
GCCCTCACCC ACGGCCCCGA CCCCCTCCAC CTCGCCGCCG TCTTCGGCCT CGACCAGAAC
ACCGCCATGC GCTACGCCAA CGCCGCCCGC CACCTCCTCG AGTCGCTCGC CGAGCGGCAC
ACTCCCGACG GTTCAGCAGG AACCCAAGGG TCAACCACCG GTCCAAGCAC CGACCGACCC
GCGAGTTCGC GCTGA
 
Protein sequence
MTWIRDGKPP TSCGDCLAWG VFGGRYCPAC ANFRQRFPRG TCAGCARLLP VKRGYCRLCW 
KQATLETAGR WAPAIQPFLE AVRHHQLFLA SLHHSVPVPA GRRLGKNSRN APGLTSPAEN
PGRAVAWVQL RLIDLPRDFT RFDRHNADLT NPLLVDARRR ARAMGEARGW TRRVHTDVDR
ALVILLSGLA PGEKVRYSDM FPALQARWIS VERTVQALDH LGLLDDDRQS TFDAYLEHKL
DGITPGIRRD VEDWIRTLYA GGPRTRAHSK NTAYGYLNEI KPTLLDWSTR FHHLREITGD
DIQKVISSVH GNKRDHTIVV LRSLFDHCKK TGTIFRNPVA RLRAGRKHYN LILPLHPERV
SMVLDAATSP AARLVVVLAG IHAARNKTTR HVQLDDVDLG NRRLVIADIN RPLDDLTYHA
VLDWLAYRRD RWPNTANPHL IVNGQTALGH GPVSDSWLSL IVRGLPVTLE QLRVDRQLDE
ALTHGPDPLH LAAVFGLDQN TAMRYANAAR HLLESLAERH TPDGSAGTQG STTGPSTDRP
ASSR