Gene Francci3_0621 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0621 
Symbol 
ID3903489 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp704904 
End bp706508 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content75% 
IMG OID637877954 
Producthypothetical protein 
Protein accessionYP_479734 
Protein GI86739334 
COG category[G] Carbohydrate transport and metabolism
[S] Function unknown 
COG ID[COG0062] Uncharacterized conserved protein
[COG0063] Predicted sugar kinase 
TIGRFAM ID[TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related
[TIGR00197] yjeF N-terminal region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0172288 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.797481 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCAATG CCCATACGGT CGAACAGGTT CGAGGTGCCG AGGCGCCGCT GCTCGCCTCG 
CTGCCCCCCG GCGCTCTGAT GCAGCGGGCG GTGCACGGGC TCGTCGCGCA CGCCGCACGA
CGCCTCGGGC GGGTCTACGG TGCCCGCGTC GTCGTGCTCG CCGGCGGTGG CGACAACGGT
GGTGACGCCC TCTGGGCCGG AGCGCGGCTG GCCGCCCGAG GGGCGCGGAT TGATGCGGTG
GCACCCGGCC GCACCCACCC GGAAGGCACC GCAGCCCTGC TTGCGGCCGG TGGCCGACTG
CACCATCCGG GTCCGGTGGT GGGCGCGGGC GGTCCGGCGG GTGCGGGGCA GCCGATCGAC
GATGAGCGGC TGCGTATCCT GTTCGCCGCC GCCGATCTGG TGCTGGACGG GTTGCTCGGC
ATCGGGGGCC GGGGCGGGCT GCGCGAGCCG TACGCCCGGC TCGCCGTTCT GGCACCCCCC
GAACGGACGG TGGCGGTCGA CGTCCCCAGC GGGGTGGACG CGGACACCGG GGCCGTCGCG
GGGCCGGCGG TCCGGGCGAG CAGTACCGTC ACCTTCGGTA CCCGCAAACG CGGCCTGTGG
CTCATGCCCG GTGCCGCCCA CACCGGCCCC GTCGAGCTGG TCGACATCGG ACTGGACCTG
CCTGAACCCG ACCTGTGCGC TCTCGACGAC GCCGACGTCG CCGCGGCATT GCCCGTTCCC
AGCCCGACGG CGTCCAAGTA CAGTCGCGGG GTCCTCGGCC TCGTCGCCGG CAGCGACGCC
TATCCGGGCG CCGCGGTGCT CGCGGTCGGC GGCGCGCTGC GTGGTGGCGC CGGGTACCTG
CGGGTAGTGA CCGCGGGGCA CGCCGAGAAG GTCGCGGGCG AGGCCGTCGG CGCGGTCGTG
AGCAGGGCCG GCGATTTCGT GCGGATGGCG CATCCCGAGG CCGTCGTGAC CGTCATCGAG
GCCGGTGACG CCGACACGAT GCTGGCCGCC GGCCGGGTCC AGGCCTGGGC GATCGGCCCA
GGCCTGGCGC CGGGACCCGC GGTCCGTACC CTGCTGACGG CGTTGCTGGC GACGGACCTT
CCGGTCCTGG TCGACGCGGG GGGCCTCGAC CCGCTTGCCG AGATCATCGC GGCGCGGCCC
GCCGTCGCCG AGCGGGCGGC GCCGGTGCTG ATCACACCGC ATGAGGGCGA GTTCCAGCGG
TTCGTCTCGG TCGCCCTTGG CCGGGACGCG CAGGCCACCG TCGCCGAACT CGCCGACGAT
CGGCTCGCTG TTCTCCGGCG GGCCGCGGCG GCGACGGGAG CGGTGATCCT GCTCAAGGGC
GCCCGCACAC TCGTCGTGCG GCCGGATGGA TCGGCGCTGG TCAACACGAC CGGATCGGCA
TGGCTGGGGA CCGCCGGGAC CGGTGACGTG CTGACCGGCC TGATCGGCTC GCTGCTCGCC
GCCGGGCTCG CCCCCGCGAC GGCCGGGGCG GTCGGCGCCT ATCTGCACGG CCGGGCCGCC
GAGCGGGCCC CGGCGCCGCT GGCAGCGAAT GATCTGCCCG CCCTCCTGCC CCAGGTCATC
GGCGACCTGC TTGACCGGCG ACGGATGCCA CGCGGAGTAG CGTGA
 
Protein sequence
MRNAHTVEQV RGAEAPLLAS LPPGALMQRA VHGLVAHAAR RLGRVYGARV VVLAGGGDNG 
GDALWAGARL AARGARIDAV APGRTHPEGT AALLAAGGRL HHPGPVVGAG GPAGAGQPID
DERLRILFAA ADLVLDGLLG IGGRGGLREP YARLAVLAPP ERTVAVDVPS GVDADTGAVA
GPAVRASSTV TFGTRKRGLW LMPGAAHTGP VELVDIGLDL PEPDLCALDD ADVAAALPVP
SPTASKYSRG VLGLVAGSDA YPGAAVLAVG GALRGGAGYL RVVTAGHAEK VAGEAVGAVV
SRAGDFVRMA HPEAVVTVIE AGDADTMLAA GRVQAWAIGP GLAPGPAVRT LLTALLATDL
PVLVDAGGLD PLAEIIAARP AVAERAAPVL ITPHEGEFQR FVSVALGRDA QATVAELADD
RLAVLRRAAA ATGAVILLKG ARTLVVRPDG SALVNTTGSA WLGTAGTGDV LTGLIGSLLA
AGLAPATAGA VGAYLHGRAA ERAPAPLAAN DLPALLPQVI GDLLDRRRMP RGVA