Gene Franean1_2061 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2061 
Symbol 
ID5670462 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2484335 
End bp2485441 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content78% 
IMG OID641240983 
Productprotein of unknown function UPF0052 and CofD 
Protein accessionYP_001506404 
Protein GI158313896 
COG category[S] Function unknown 
COG ID[COG0391] Uncharacterized conserved protein 
TIGRFAM ID[TIGR01826] conserved hypothetical protein, cofD-related 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.282535 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGCCG ACGCTCCCGC GATCGTCGCG TTCGGCGGCG GGCACGGCCT GGCCGCGTCG 
CTGGCCGCGC TGCGCCGGAT CACCCGCCAC CTGACCGCGG TGGTTACCGT GGGTGACGAT
GGTGGTTCGT CGGGCCGGCT GCGCGCCGAG CTCGGCGCCC TGCCCATGGG CGACCTGCGG
ATGGCCCTCG CCGCGCTGGC CGGCGCCGAC GAGTGGTCCC AGACGTGGGC CGACCTCTTC
CAGCACCGGT TCGGCGGCGG CGGGGCGCTC ACCGGCCACG CGGTCGGCAA CCTCGTGCTG
ACGGCGCTGG CGGAGCGCGC GGGCTCCCCG GTGGCCGCGC TCGACCTGGC GGCGTCGCTG
CTCGGCGTCG ACGGCCGGGT GCTGCCACTG TCATGCGCCG GCATCGACAT CGTCGCGGAC
GTGACCGGCC TCGACCCGAA CCGGGCCGGG GAGTCCGCGG AGGTCCGCGG CCAGGCGGCC
GTCGCGACGA CCCCCGGCCG GGTGGCCGGG GTACGCCTGG CCCCGGCGGA GCCGGCGGCC
TGCGGTGCCG CGCTCGCGGC CGCCGCCGCC GCCGACTGGA TCGTCCTCGG TCCGGGCTCG
CTCTACACGA GCGTGCTGCC GCACCTGCTC GTGCCCGACA TGCGCGCGGC GATCACGGGC
GCCGACGCCC GCCGGGTGAT GGTGCTCAAC CTCGTGGCGC AGCCGGGCGA GACGGCCGGC
TACACCCCCG AGGCCCACCT GCACGCCCTG GCCACGCACG TCCCGGGGCT GCGCCTCGAC
GTGGTGATCG CGGACCCGGC CGCGGTCGGC GACCCGGACC CGCTGGCCCG CGCGGCCGCG
GACCTGGGCG CCCGGCTCCA CCTCGCACCC GTGCGGGTTC CCGGTGAACC CGCACTGCAC
GACCCCGAAC GTCTCGCGGC CGCGTTCCGC GCCGTCTTCG CGCAGGACGG CGCCGTGGCG
GTGCCGTCGG CGGCCCGTCC GATGGACGGC CAGCGGGCCT GGCCTGGCCG GCAGCCGGGA
GCGGCCTGTG CCGACCCCTC GGGTGTCGGG TGCTCATCCA CCGGTGGCTC CGGCCACCGG
ATCCCCTCCG GTGAATGCAA GGAGTGA
 
Protein sequence
MTADAPAIVA FGGGHGLAAS LAALRRITRH LTAVVTVGDD GGSSGRLRAE LGALPMGDLR 
MALAALAGAD EWSQTWADLF QHRFGGGGAL TGHAVGNLVL TALAERAGSP VAALDLAASL
LGVDGRVLPL SCAGIDIVAD VTGLDPNRAG ESAEVRGQAA VATTPGRVAG VRLAPAEPAA
CGAALAAAAA ADWIVLGPGS LYTSVLPHLL VPDMRAAITG ADARRVMVLN LVAQPGETAG
YTPEAHLHAL ATHVPGLRLD VVIADPAAVG DPDPLARAAA DLGARLHLAP VRVPGEPALH
DPERLAAAFR AVFAQDGAVA VPSAARPMDG QRAWPGRQPG AACADPSGVG CSSTGGSGHR
IPSGECKE