Gene Francci3_1586 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1586 
Symbol 
ID3903721 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1902072 
End bp1903406 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content70% 
IMG OID637878923 
Productglycosyl transferase family protein 
Protein accessionYP_480691 
Protein GI86740291 
COG category[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.807446 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.639173 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCTGC ACGGGAATCG GGAGTCCCCG GCCGGCCCAG CCGCGGCCGA CGAGGGTATC 
TGGTGCTGTG AGCTCGAGGT GTCGGCCGGG GGCGTCGTGC GCTCGGTCGT GCCGCCACGC
GACCGTCAGC GACGTGCGCG GGTCCTCGCC CTGCTGCACG GTGAGCCGCT CGGCTACGCG
GCGGCCGCCA GGACGGCCAC CGGCATCGAC GTCGATGAGC TGCTCGGCAG GGTCTGGTCG
GAGTTCGGCG GGCGCATCAA CGATCACCTG GTCGGCGAGG GTCTCCGCCC GCTCGAGCGC
CTCGACGCCC GCACCAGGCC TGCCGCGGCC ACCGAGGCCT GCCCGAACAA GGTTCAATCC
ACCGATCTCG ATTCCACGGA TCTCGTCTCC GTCGTCGTCG CCACCAGAGA CCGCAGTGAG
ATCCTGGCCA ACTGCCTGCG TCGACTCACG GAGGTCACCT ATCCGGCCGT CGAGTTCCTC
ATCGTGGACA ATGCCCCGTC GAGCGACGCC ACGAAGCGGG TCGTCGACTC GTTTTCGGCC
ACCGACGAAC GATTCCGCTA TGTCCCCGAA CCACGCCCGG GTCTCTCCCG CGCCCGCAAC
CGCGGGCTCG CGCTGGCGCG GGGGGTGTAC GTCGCCTATA CCGACGACGA CGTCTCGGTT
GACCCCGGCT GGATCGACGG GCTGGTTCGG GGTTTCCGGC GCCGGCCGGA CGTTGCGTGC
GTCACGGGAC TCGTCTGCAC GGCGAGCATC GTGAGCGCGG CCGAAGTCTA CTTCGACGCC
CGGGCGTCGT ACTGGTCCAC CCGCTGCGAG CCGGTGCTTT TCGACCTCGC CGACAACGAG
CGGCACGGTC CGCTGTACCC CTATATAGGT TTCGTCGGCA CCGGCGCGAA TGTCGGGTTC
GACGTCGCGT TCCTGCGGGA CCTGGGTGGC TTCGACGAGG CGTTGGGAGC GGGCACCAGG
AGCCGGGGCG GCGAGGACCT CGATCTCTTC GTGCGCATGT TGCGGGCGGG CCGAGCGATC
GCGTACGAAC CAGCCGCCTT CGTCTGGCAT CACCACCGCG CCGACGACGC GGCGCTGCTC
GCTCAGATGT TCGGCTACGG CTCCGGTTTC ACGGCCTTCC TGGCCAAGCT GCTTCTCCAG
CGGTCGACCC GGGGTGAGGT GGTACGTCGC ATCCCACGAG GCCTGCGTGG GATGGCCCGC
ATCGGCCATG CGACGAGCCA GCGGCTCGAC GGGCGGGTCC AGGCCCCGAA GGGTGCCCTG
CTGCGGGAGT TCGCCGGCTA CGCCGCCGGA CCGTTGCTCT ACGCCCGGGA GCGGCAAACG
GCCGCCTGGC GGTGA
 
Protein sequence
MNLHGNRESP AGPAAADEGI WCCELEVSAG GVVRSVVPPR DRQRRARVLA LLHGEPLGYA 
AAARTATGID VDELLGRVWS EFGGRINDHL VGEGLRPLER LDARTRPAAA TEACPNKVQS
TDLDSTDLVS VVVATRDRSE ILANCLRRLT EVTYPAVEFL IVDNAPSSDA TKRVVDSFSA
TDERFRYVPE PRPGLSRARN RGLALARGVY VAYTDDDVSV DPGWIDGLVR GFRRRPDVAC
VTGLVCTASI VSAAEVYFDA RASYWSTRCE PVLFDLADNE RHGPLYPYIG FVGTGANVGF
DVAFLRDLGG FDEALGAGTR SRGGEDLDLF VRMLRAGRAI AYEPAAFVWH HHRADDAALL
AQMFGYGSGF TAFLAKLLLQ RSTRGEVVRR IPRGLRGMAR IGHATSQRLD GRVQAPKGAL
LREFAGYAAG PLLYARERQT AAWR