Gene Francci3_3941 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3941 
Symbol 
ID3906900 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4718124 
End bp4719152 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content67% 
IMG OID637881268 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_483020 
Protein GI86742620 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.438556 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGTGC TCGTGACGGG CACGGAGGGG TATCTGGGGT GCCTGCTCGC GCCCGAGCTG 
CTGCGTGATG GCCATGAGGT CATCGGCGTG GACACCGGCT ACTACAAGTA CGGCTGGCTG
TACCGCGGCG TCGACCGGAC CCCGTTGACC CTTGACAAGG ACCTTCGCCA TCTCACGGTC
GAGGACTTCG CGGGGGTCGA CGCCGTCGTG CACATGGCGG AGCTGTCCAA CGACCCGCTC
GGCGCGCTCG CCCCGGACGT GACGTACAAG GTGAACCACG TCGGCTCGGT CCGGCTGGCG
AAGCTGGCCA AGCAGGCCGG CGTCGAACGG TTCGTCTACA TGTCCTCCTG CAGCGTCTAC
GGCGTCGCGA CCGGTGTGGA CGTCACCGAG GCCTCGCCGG TGAACCCGCA GACCCCCTAC
GCCGAGTGCA AGGTCTACGT GGAGCGTGAC GTCGCCCCGC TGGCGGACGA CACCTTCTCG
CCGACCTTCC TGCGCAACGC CACCGCCTAC GGTGCCTCCC CGCGGCAGCG TTTCGACATC
GTGCTCAACA ACCTGGCCGG GGTGGCCTGG ACTACCGGCG AGATCGCGAT GACCTCGGAT
GGCACCCCGT GGCGCCCGCT GGTCCACGGG CTCGATATCG CGAAGGCGAT CCGCCTGGTG
CTGACCGCAC CGCGCGACAT CGTGCACAAC CAGATCTTCA ACGTCGGCGA CAGCGAGCAG
AATTACCAGG TGAAGGAGAT CGCGGACGCG GTCGCCACGG TGTTCACCGG CTGCACGCTG
AGCTTCGGTG ACAACGGCGG TGACAATCGC AGCTACCGGG TGTCGTTCGA CAAGATCGCC
TCCACCCTGC CCGGCTTCTC CTGTGACTGG AACGCGCTCA GGGGCGCCCA GCAGCTGCAC
GACGTCTTCA CCCGTATCCA GCTCGACAAC GAGACGTTCA CCGGCCGCGG GCACACCCGG
CTCAAGCAGC TTCAGTACCT GATCCGCACC GGCCAGCTCG ACGCCGACCT GTTCTGGGCC
CACTCGTGA
 
Protein sequence
MKVLVTGTEG YLGCLLAPEL LRDGHEVIGV DTGYYKYGWL YRGVDRTPLT LDKDLRHLTV 
EDFAGVDAVV HMAELSNDPL GALAPDVTYK VNHVGSVRLA KLAKQAGVER FVYMSSCSVY
GVATGVDVTE ASPVNPQTPY AECKVYVERD VAPLADDTFS PTFLRNATAY GASPRQRFDI
VLNNLAGVAW TTGEIAMTSD GTPWRPLVHG LDIAKAIRLV LTAPRDIVHN QIFNVGDSEQ
NYQVKEIADA VATVFTGCTL SFGDNGGDNR SYRVSFDKIA STLPGFSCDW NALRGAQQLH
DVFTRIQLDN ETFTGRGHTR LKQLQYLIRT GQLDADLFWA HS