Gene Francci3_1420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1420 
Symbol 
ID3903401 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1711502 
End bp1712542 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content71% 
IMG OID637878757 
Producthypothetical protein 
Protein accessionYP_480526 
Protein GI86740126 
COG category[R] General function prediction only 
COG ID[COG0325] Predicted enzyme with a TIM-barrel fold 
TIGRFAM ID[TIGR00044] pyridoxal phosphate enzyme, YggS family 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0186857 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0858519 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCACC GGCCCGAGAT GCCCGCGAGG CATGGCGCAT CCGGCTCGGG GGAGTCCGAC 
GGACCGTTCG ACCCGGACCG GCCTGCCCCG GACCGGCCTG CCCCGGACCG GCCTGCCCCG
GACCGGCCTG CCCCGGACCG GCCTGCCCCG GACCGGCCTG CCCCGGACCG GCCTGCCCCG
GACCGGCCTG CCCCGGACCG GCCCGCCCCG GACCGGCCCG CCCCGATCGA GCTCGATCCG
GCGCGCCTGG ACCGGTTGAC GCAGCGGCTG GCCGAGGTCC GGGCTCGGAT CGCGGGGGCG
GCCCGGGCCG CGGGCCGTGA TCCGGACCAC CTCACCCTCA TTGCGGTCAG TAAAACCTAC
CCACCCCAGG ATGTTGTGAT GATGCACACG CTCGGGGTGC GGCACTTCGC CGAGAACCGG
GAGCAGGAGG CCGGGCCGAA GGTGAGTCTC GTCACCCGGC TGATCGGCGG GGAACGGAGC
GTCCCGGCCA AGGGAACCGG TGACGGCCTG TCCTCCGGTG CCACCGGTTC CGACGATCCG
ATCTGGCATT TCGTGGGACA ACTGCAGCGC AACAAGGCCA GATCCGTTCT TCGTTGGGCG
GATTGGGTGC AGTCGGTGGA TCGGGTGAGC CTGGTGCCAG TGCTCTCCCG GCTGGCAATG
GAACGCGGCC GCCCGCTGTC GATCTGTCTC CAGGTCTCGT TGGACCTCCC TGGTGCTTCC
GATGGGAAGA TCGGCGCGTC GATCGCCGGC TCGAGGCGCG GAGGGATCGA TCCGGCCGGC
CTTTCCGCCC TGGCCGATCT TGTCGAGGAG GCGCCGGGAC TGGCCCTGCG AGGCGTGATG
GCTGTCGCCC CCCGGAGGGG GCAGCCACGA CCTGCGTTCG CGCGACTGCG TGAGGTGGCG
GAACGCCTGA AGGTGGGGCA TCCCCAGGCC ACCGTCATCA GTGCCGGCAT GTCGGGAGAT
CTTGAGGACG CTGTGGCCGA AGGCGCGACA CACCTTCGGA TCGGCACCGC TTTGTTCGGT
GAACGGCCTG GTGTCCCTTA G
 
Protein sequence
MIHRPEMPAR HGASGSGESD GPFDPDRPAP DRPAPDRPAP DRPAPDRPAP DRPAPDRPAP 
DRPAPDRPAP DRPAPIELDP ARLDRLTQRL AEVRARIAGA ARAAGRDPDH LTLIAVSKTY
PPQDVVMMHT LGVRHFAENR EQEAGPKVSL VTRLIGGERS VPAKGTGDGL SSGATGSDDP
IWHFVGQLQR NKARSVLRWA DWVQSVDRVS LVPVLSRLAM ERGRPLSICL QVSLDLPGAS
DGKIGASIAG SRRGGIDPAG LSALADLVEE APGLALRGVM AVAPRRGQPR PAFARLREVA
ERLKVGHPQA TVISAGMSGD LEDAVAEGAT HLRIGTALFG ERPGVP