Gene Franean1_6456 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6456 
Symbol 
ID5674771 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7849889 
End bp7850866 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content76% 
IMG OID641245304 
Productheat shock protein DnaJ domain-containing protein 
Protein accessionYP_001510699 
Protein GI158318191 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0484] DnaJ-class molecular chaperone with C-terminal Zn finger domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00677011 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGCGTGATA CGCCAAAAAG CCAGTCGGCC CAGTACTTGC GGGCCTCGAT GTACGAGGTG 
CTCGGGATCG CACCCACGGC TTCGGACGAG GAGGTCCATG CCGCCTATCG GCGCGTGGTG
AAACGCGCCC ATCCGGACGC CGGCGGATCC CAACGCGCGT TCCTCCGGGT GAACGCGGCG
TACCGGGTGT TGAGCGACCC CGGTATGCGG CGAGCCCACG ATCTGTGGCT CGCCCATCTG
CTCGACGCAT ATGACCAGCC GGGACGCTCC GGCGGCGCGC GCCCGCCCGG CGGGCGGGCC
GCGCCCGGCG GGCGCCACCC CGCCGACGGG CGTCCCGGAT CCGACGGACG AACGAACCCG
GGCGGGCGCA CGTCGCCCGG CGGGCAGGCG GACAACCCCG GACGGGGGGC TTCCGGCAAT
CGGGGCACCT CGGATGACCC GGCACCCCCC TCGGGGGGTG CCGCCCCGGG CCGGCGGGGA
GCGTCCGGCC GGGCCTCCGA TCAGAGGGGG CCAGGCAAGG GCACGGACTC CGCCAGCGGA
TGGGGCGAGG CCGGTGGCTG GGCCGCCACC GGTGGCTGGG GCGATGCCAC TGCCGCTCCC
CTACCCGAGG ATGGCCGGGC GTCCGGCGGG CGGCGACGCT CGCGCCGGCG CCCACCAGCC
GACGCCGCCG AGTGGGTCGT GGGGCCCGAC CAGGCCATGA CACCGGACGG CGGCCCAGCC
GGGCCGCCCC CGTACGAGGC ACCCGGCGGG GCCGCGACCT GGGCCACCTG GCCGGACGAG
GACTACCCCA CGCGGGGTCC CGGCCGGCGG GCACGGCGCA GGTACCTGGT CTCGATGGCG
CTGTGCCTGG CCCTGTTCGT GCTGGCGGGC GCGGTGGTGC GGCTCTACTC CGTCCCGGTG
GCGATGGGCA TGATGCTGGC CTCGATGGTG ATCCCGCCGG TGGCGGTCCT CGCGGTCAAC
GCCGCACGCC GCCGCTGA
 
Protein sequence
MRDTPKSQSA QYLRASMYEV LGIAPTASDE EVHAAYRRVV KRAHPDAGGS QRAFLRVNAA 
YRVLSDPGMR RAHDLWLAHL LDAYDQPGRS GGARPPGGRA APGGRHPADG RPGSDGRTNP
GGRTSPGGQA DNPGRGASGN RGTSDDPAPP SGGAAPGRRG ASGRASDQRG PGKGTDSASG
WGEAGGWAAT GGWGDATAAP LPEDGRASGG RRRSRRRPPA DAAEWVVGPD QAMTPDGGPA
GPPPYEAPGG AATWATWPDE DYPTRGPGRR ARRRYLVSMA LCLALFVLAG AVVRLYSVPV
AMGMMLASMV IPPVAVLAVN AARRR