Gene Francci3_3680 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3680 
Symbol 
ID3905364 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4414980 
End bp4416350 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content71% 
IMG OID637881006 
Productputative pep2 protein 
Protein accessionYP_482761 
Protein GI86742361 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3281] Uncharacterized protein, probably involved in trehalose biosynthesis 
TIGRFAM ID[TIGR02457] trehalose synthase-fused probable maltokinase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTGACC ACCACGCGGA ACTGACCGGC CTGTTGCGCG AGTGGCTCCC CCGGCAGCGC 
TGGTTCGCCG GGAAGGGCAG ATCCGGGGGC GGCCTGCGGA TCCGTCACGA CGTCATGGTC
TTCGAACCGC TACGGCTGCT CGTCGTTGAC GTCGACTACG ACCACGGTCC GTCGGACCAT
TACCAGGTTC CGGTGGTCAT CCGCTCGGAC GCGCCCTTCG GCCACGAGGG CTTCCTGATC
GGCGAGAGTG CCGGCGGCCT GGTCTACGAC GGCCTGCACG ACCCCGATGG CAGCTCGGCC
CTGCTGAGTT TCCTCCGCCG GTCGACACAC CGCGACGGCC TCGTCGCCGA GGCGCTGGAA
CCACTGGACA TCCTGCCCGC CCACGCCGTC GGCGCCGAGC AGTCCAACAC CTCGATCGTC
TACGGCGACG CCTACATCCT CAAGGTGTTC CGGCGGCTCT GGCCGGGCGT CAACCCCGAT
CTGGAGGTGA CCCGGGCGCT GGCGGAGGCG GGCAGCACAC ACGTGGCCCG CCCCGTCGCC
TGGTACTCCG GCCGTGTCGA CGGGACACCG ACCACCTTCG GATTCCTCCA GGAGTACCTG
CGCAGCGGTA CCGAGGGCTG GCGACTGGCG CTGGCCAGCG TGCGCGACCT GTACGCCGAG
GCCGACCTGC ACGCCGACGA GGTCGGCGGC GACTTCGCCG CGGAGGCCGA GCGGCTGGGT
ACCGCCACCG CCGAGGTGCA CGCCGACCTC GCCCGGACGC TGCCGACGAG GCCCGCGACC
GCCGATGCCC TGCGCTCCCT GGTGACCTAC CTGCATGGCC GGCTGGATGC GGCGGTCGAG
GCCGTCGCCG AGCTGCGTCC CTTCGAGGCG GCGATCCGCA AGAGCTACGA CGAGGTGACG
GGGGCGTCGG CGCAGGCCGT CCATCCCCGG CCCTTCCAAC GGCTACACGG CGATCTGCAC
CTGGGTCAGG TCCTGCGGGT CGACTCGGGC TGGGTGCTGT TCGACTTCGA GGGGGAGCCA
GTCCGCCCGG TCGCCGACCG TGTGGGGCTG GAATCGCCGC TGCGCGACGT CGCCGGGATG
CTGCGCTCGT TCGACTACGC GGCCCGGTCG ATGCTGCTCG AACGCGGTGA CGAGCCGTCG
TTGGCCTACC GCGCCCAGGA GTGGGCAGAT CGCAACCGGG AGGCGTTCTG CCGGGGCTAC
GGTGCCGCGG CCGGCCGCGA TCCTCGTGCG GATGCCGTGG TGCTGCGTGC GTTCGAGCTC
GACAAGGCCG TGTACGAGAT CCTGTACGAG GCCCGCCACC GACCGGGGTG GATCGGGATC
CCGCTGTCCT CGGTGGAACG ACTGACCCGG TCCAGCCCGT CGGCGGGGTA A
 
Protein sequence
MGDHHAELTG LLREWLPRQR WFAGKGRSGG GLRIRHDVMV FEPLRLLVVD VDYDHGPSDH 
YQVPVVIRSD APFGHEGFLI GESAGGLVYD GLHDPDGSSA LLSFLRRSTH RDGLVAEALE
PLDILPAHAV GAEQSNTSIV YGDAYILKVF RRLWPGVNPD LEVTRALAEA GSTHVARPVA
WYSGRVDGTP TTFGFLQEYL RSGTEGWRLA LASVRDLYAE ADLHADEVGG DFAAEAERLG
TATAEVHADL ARTLPTRPAT ADALRSLVTY LHGRLDAAVE AVAELRPFEA AIRKSYDEVT
GASAQAVHPR PFQRLHGDLH LGQVLRVDSG WVLFDFEGEP VRPVADRVGL ESPLRDVAGM
LRSFDYAARS MLLERGDEPS LAYRAQEWAD RNREAFCRGY GAAAGRDPRA DAVVLRAFEL
DKAVYEILYE ARHRPGWIGI PLSSVERLTR SSPSAG