Gene Franean1_1610 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1610 
Symbol 
ID5670013 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1926796 
End bp1928163 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content77% 
IMG OID641240529 
Producthypothetical protein 
Protein accessionYP_001505955 
Protein GI158313447 
COG category[S] Function unknown 
COG ID[COG5267] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00247546 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCACGCCA CCATGGTCGT CATGGCAGCC GCGGTCACCA CCGACAGCCC CACCGGGGTG 
CGGGCCGAGC TCGCGCTCCT GTACCGGCGG GCCGGCTTCG GGGCCCGTCC CGACGAGCTC
GACGCCGCCG AGCGGGCCGG CTACGAGGCG GCGGCCCGGG GCCTGCTCGA CGCGCACGAC
CGCCCCGACC CCGGGGCGGT ACCCCCGCCC GACCTGGACG CCGAGGCCGA CCGCCGGCCA
CCGCCCGCGT CCGACCGGAC CGCCCGCGCG GCGTACGCCC GCGGGCTGCG CGAGCGCACC
GAGGCGCTGA TCGGCTGGTG GCTCGGCCGG ATGGTCTCGA CCGCCGACCC GGTCACCGAG
AAGCTGACGC TCTTCTGGCA CGGCCACTTC GCGACGTCCA TCCAGAAGGT GCGCTCGCCG
GCGCTGATGC TCGCCCAGAA CGAGATCTTC CGGCGGTCCG GCCGCGGGCC GTTCGACGCG
CTCACCCTCG CCGTCGCGCG CGACCCGGCG ATGCTGATCT GGCTGGACGC CGGCCGCAAC
GTCAGGGAGC ACCCGAACGA GAACTTCGCC CGCGAGCTGA TGGAGCTGTT CACCCTCGGC
ATCGGCAACT ACGAGGAGCG CGACGTCCGG GAGGCGGCGC GGGCGTTCAC CGGCTGGCGG
GCCGGCCGCA CCGGCCGCTT CACCGTCGCC GCGGCGCAGC ACGACTCGGG CACCAAGACG
ATCCTCGGCC ACACCGGTGA TCTCGGCGGT GAGGACGTCG TCAGCCTGCT CGCCCACGAC
CCGGCCTCGC CGCGCTGGGT GTGCTCCCGG CTGTGGAGCC GGTACGCCGC ACCGGTGGCA
CCCGCCGACC AGGCGCTCAC TCCGCTGCTC ACCGCCTACG GGCCGGGGCT CGACACCGGC
GCGGCGCTGC GGGCGATGCT CCTCGCGCCC GGCTTCCAAG CCACCGCCGG GCGTCTGGTG
GCGCCGCCGA TCGACTACGT GGTGGGCGTG CTCCGCGCGC TGCGGATCAC CGTCCCGGCG
CCCGGGACGG GCGGTGCCGC CGCGCCGGCG CGCCTGGCCA CAGTGCTGCG CGGGCTCGGC
CAGGTCCCGT TCGCGCCGCC CAGCGTCGGC GGCTGGCCGG CGGGCGGCGC GTGGCTGACG
ACCGCGGCGA GCCTGGTCCG GACCCGCTTC GCGGCGGCGG TCGTGGACGC GGCCGACATC
TCGGCCGTGG CGGACGTCCC GGCCGCGGAG CGGGTGGACG CCGTCGCCCG GCTGTTGTCC
GTGCCGGCAT GGAGCGCGCG CACCCGCTCC GCGCTGTCCG CCGCGGCCGC CGACCCGCCC
CGGCTCGTCA CCCTCGCCCT GCTCGCCCCG GAGTACCTCG TCACCTGA
 
Protein sequence
MHATMVVMAA AVTTDSPTGV RAELALLYRR AGFGARPDEL DAAERAGYEA AARGLLDAHD 
RPDPGAVPPP DLDAEADRRP PPASDRTARA AYARGLRERT EALIGWWLGR MVSTADPVTE
KLTLFWHGHF ATSIQKVRSP ALMLAQNEIF RRSGRGPFDA LTLAVARDPA MLIWLDAGRN
VREHPNENFA RELMELFTLG IGNYEERDVR EAARAFTGWR AGRTGRFTVA AAQHDSGTKT
ILGHTGDLGG EDVVSLLAHD PASPRWVCSR LWSRYAAPVA PADQALTPLL TAYGPGLDTG
AALRAMLLAP GFQATAGRLV APPIDYVVGV LRALRITVPA PGTGGAAAPA RLATVLRGLG
QVPFAPPSVG GWPAGGAWLT TAASLVRTRF AAAVVDAADI SAVADVPAAE RVDAVARLLS
VPAWSARTRS ALSAAAADPP RLVTLALLAP EYLVT