Gene Franean1_1778 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1778 
Symbol 
ID5670180 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2135861 
End bp2136772 
Gene Length912 bp 
Protein Length303 aa 
Translation table11 
GC content76% 
IMG OID641240699 
Producthypothetical protein 
Protein accessionYP_001506122 
Protein GI158313614 
COG category[R] General function prediction only 
COG ID[COG1090] Predicted nucleoside-diphosphate sugar epimerase 
TIGRFAM ID[TIGR01777] conserved hypothetical protein TIGR01777 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00178055 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.185841 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGTCG CCGTCACCGG CTCGTCCGGG CTGATCGGTT CGGCGTTGCT GCCCGCGCTG 
CGCGGGGACG GCCACGAGGT CGTCACCCTC GTCCGGCGCC CGCCGCGCGC CCCGTCCGAG
ATCCGCTGGG ACCCGGCGGC CGGCACGCTG GACGCCGCCG CCCTGGCCGG CGTGGACGGC
GTCGTGAACC TGGCCGGCGC CGGCATCGGC GACCGCCGGT GGACCGCCGC CTACAAGCAG
ACCCTTCGGA CCAGCCGCAT CGACGGCACC CGCCTGCTCG CCGAGGCCCT CGCCGGCCTC
GACCCGCGCC CGCGGGTCCT GCTCTCCGGC AGCGCCATCG GCTGGTACGG CACGAACGCC
GGCTCGGCCG GCGCCGCGCT GGACGAGACG GCCCCGCCCG GCACCGGCTT CCTCGCCGAG
CTCGCCCGTG ACTGGGAGAA CGCGACCACA GCGGCGCAGG AGGCCGGCAT CCGGGTCGTC
CGGGTGCGCA CCGGCATCGT CCTCTCCGGG CGCGGTGGGA CCCTCCAACG GCTGCTCCCG
ATCTTCCGGC GCGGAGCCGG CGGCCGGCTG GGCTCGGGGC GCCAGTGGCT GAGCTGGATC
AGCCTGGCCG ACACCGTCGA CGCGCTGTGC TTCCTCCTCG AGGCCGACGG AGTACGCGGG
CCGGTCAACC TGGTGGCGCC CACCCCGGTG ACGAACGCGG AGTTCACGTC GGCGCTGGCG
CGGACGCTAC GGCGCCCGGC CTTCGCCCAG GTACCGCGCT TCGCACTACG CCTGGCCCTG
GGCGAGTTCG CCGACGAGGG ACCACTCGCC TCCCAGCGGC TCGCGCCGGC CACGCTGGTC
GACGCCGGGT TCCGGTTCAA CCACTCCGAC CTCGCCACCG CGCTGGCCGA CGCCGTCCAC
CGCGACGCCT GA
 
Protein sequence
MKVAVTGSSG LIGSALLPAL RGDGHEVVTL VRRPPRAPSE IRWDPAAGTL DAAALAGVDG 
VVNLAGAGIG DRRWTAAYKQ TLRTSRIDGT RLLAEALAGL DPRPRVLLSG SAIGWYGTNA
GSAGAALDET APPGTGFLAE LARDWENATT AAQEAGIRVV RVRTGIVLSG RGGTLQRLLP
IFRRGAGGRL GSGRQWLSWI SLADTVDALC FLLEADGVRG PVNLVAPTPV TNAEFTSALA
RTLRRPAFAQ VPRFALRLAL GEFADEGPLA SQRLAPATLV DAGFRFNHSD LATALADAVH
RDA