Gene Franean1_4853 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4853 
Symbol 
ID5673193 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5819591 
End bp5820856 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content75% 
IMG OID641243708 
ProductTPR repeat-containing protein 
Protein accessionYP_001509124 
Protein GI158316616 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.712178 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.175346 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGCAG ACGACAACGC CCCCACGGGC ACCGACACGT ATGCCGGCTG CGCGCGCCCG 
CAGCCTGTCG GGGTGTTCCC GCTGCCCGCG GGATTCCTGC TGGTCCCTGG CGGCGAGACG
ACGTCCGGCC TGCGCCGCAC GCTGGCGGAT GGACGGCTGC CCTCGGCGTG GCCGCCGGAG
CTGGCCGCGC TCGAGCTCGC CTACCGCGGC AACGTGATCG CGGCGGTCGA GCTGCTACGC
GGCGACGACC CGGTCACCCG CTACAACCGG TTCGTGCTCC GCCCCGCGGG CCGGCCGCCG
GAGGACCCGG CCGCGCTGCG CGAGGCGCTC GGCGCCGAGC TGGGCATCCT GGTGGACGTC
GTCCGCTTCG CCCTCGGCGA GCTGGACGAG CCGCCGCCGC TGACCGACGA GACCGGCGAG
ATCGCGGCCC TGGTGTGGTC CGCGCACGCC GCGCACGCGA TGGCCGCCGG CCGGCTCGAC
GACGCCGCGC GGATGCTGGG CGAGGGCATC GCCGCCGCCG CCGATCCCTC CCCCGGGCTG
GCCGCCCAGC TCATGGCGAC GGCCGCGGGC CTGTGCCGCG ACGCCGAGGG ACCGAGTCCC
GCGGTCGCCA CCGACCTGAC CGCCGCGCTG GCGGCACTGG AGCACACCGA CCTGGCCACC
GGGCGGGCCG AGCTGCACCT GACGCTCGGC TCGGTCCACC ACGAGCTGGC CGGAGACGAC
CTGGCCGGCC TGCGCACCGC CGCCGAGCAC TATCTGGCCG CGCTGCGCCT GATCACCGTC
GACACCGCCC CGGAGCTGTT CGCGTCCGCG CAGGTCAACC TGGCCGCCGC CTACCTGGCC
ATGCCGATGA ACTCCGCGTC CGACCAGCTG CGGATCGGGG TCGCCATGCA GGGCCTGCGC
ACCGCGTTGA CCGTGTACAC CCGCCAGTCC TATCCCGAGC AGTGGGCGAG CACCCAGCTG
AGCCTGGCGA ACGCGCTCGT CTACGCGCCG TCCGCGCACC GGCGGGACAA CCTGGTCGAA
GCGGTCGGCC GCTACCAGGA GGTCATCACG ACCCGGGAAG GCCTGGCCGA CCCGCTCAGC
TACGCCCGCG CGCTGGCCAA CCAGGGCAAC GCGCTGGCAC ACCTGGGAGC CTTCCCGCAG
GCGACGGCGG TGCTGCACGA GGCCCGCTCG ATCTTCGAGC AGAACGGCGA GACCGGCGCC
GCCGCCGCGG TACGGGAGGT GCTCGACGAG ATCGCCCGCC ACGTCGCGAT GAGCCGGGCC
ACGTGA
 
Protein sequence
MAADDNAPTG TDTYAGCARP QPVGVFPLPA GFLLVPGGET TSGLRRTLAD GRLPSAWPPE 
LAALELAYRG NVIAAVELLR GDDPVTRYNR FVLRPAGRPP EDPAALREAL GAELGILVDV
VRFALGELDE PPPLTDETGE IAALVWSAHA AHAMAAGRLD DAARMLGEGI AAAADPSPGL
AAQLMATAAG LCRDAEGPSP AVATDLTAAL AALEHTDLAT GRAELHLTLG SVHHELAGDD
LAGLRTAAEH YLAALRLITV DTAPELFASA QVNLAAAYLA MPMNSASDQL RIGVAMQGLR
TALTVYTRQS YPEQWASTQL SLANALVYAP SAHRRDNLVE AVGRYQEVIT TREGLADPLS
YARALANQGN ALAHLGAFPQ ATAVLHEARS IFEQNGETGA AAAVREVLDE IARHVAMSRA
T