Gene Francci3_4345 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4345 
Symbol 
ID3907316 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp5186038 
End bp5187381 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content72% 
IMG OID637881675 
Productglycosyl transferase, group 1 
Protein accessionYP_483420 
Protein GI86743020 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGGAGT CATTGGGATC TTTCCAGGCG CTGTGCATTC TCTTCTTCTG CGAGCAGTAC 
CCGCCCGTCG TCTGGGACGG CGCGGGCACC TACACCGCGG CGCTGGCCAC GGCACTCGCG
AAGCTCGGCC ACGACGTCCA CGTCCTGTGC GCCCAAGGGC GTCATGTCGT CGACTCGGTG
GAGGACGGGG TGCACGTCCA CCGGCGCCCG TTACTACGGG TGCCGGTCAG CCGCGGGCTG
GGAAAGCTCG GCGCCCAACT CCGGGGACCC CTGTACCCGC GTGACTCGCT GGCGCTGCGG
GCCAGCCTCC CGCTGTCCTA TAGCTTGTGG ATGCGTCAAC TCGGACTCAG ACCAGACGTG
ATCGAGACCC AGGACGGGGA AACCCGCGGC CTGATCCAGG CCATACGTCG CTCCACGCCG
CTCGCCATCC ACCTCCACTG CCCGACCATG CTCACGGTCA GGCTTTCCGG CCAGCCGGTC
GGGTTGAAGG GCCTGGTCGC CGACCGGCTG GACCGGACAT CGGCGAGCCT GGCCGCCGTG
GTCACCTCAC CGTCGCAGCT GCTCGTCGAC ACGCTGCGGG AGCAAGGCTG GTTGGGTGAC
CGTGAGGTGG AGGTGATTCC CAATCCGTTC GACGCCGAGC CGTGGCTCGA CATACCCGAC
GTCGCCGACA CCACGCCGAC AGTCGCCGTC GTGGGTCGAC TCGAGGCCTA CAAGGGCGTG
GACGTGCTGC TCGACGCGTC GGCCCGCCTT CGTGGCGCGG GGGTGGCGCA CCGGCTGGTG
CTCGCGGGAC GCTCGGCGGG TGAGATCAAC GGGGTGCCCT CGGGCGAGTG GCTGGCCCGC
CGGGCCCGCC AGCTTGAGCT GGACGTCGAG TTCACCGGGC ACCTGTCCGG TTCTCAGATC
CGCGACGTCT ACCGGCGGGC CCGTGTCGTC GCCGTGCCGA GCCGCTTCGA GAGCTTCTCG
ATCGCGGCCG TCGAGGCGAT GGCGGCCGGC CGGCCGGTGG TGACCACGAC CCGGGCCGGA
GTCGCGCCCT TCGTCGAGCG GTGGGAGGCC GGATCCTCGG TACCGGCCGG CGATCCGGTG
GCGCTGGCGG ATGCGCTGGC GCCGTTCCTG CTTGACCCCG CGCGGGCGAG CGCGTTCGGG
GCACGGGGTC GCCTCGGCGT GGTCGAGATC GAGCCCTCGG CCATCGCTCG CCGCAAGGTC
GAGGCGTACC AGCGCGGCAT CGCCCACTTC GACACCCGCA ACCGGCAGCA ACACCCCCGG
CCGAGCCACG CGGCCGCCGA TTTATACCCC GACCTGTCCC GGGAACGGCG CCGCCACCGG
CACTGGCCCG CAGCCCGGCC CTGA
 
Protein sequence
MAESLGSFQA LCILFFCEQY PPVVWDGAGT YTAALATALA KLGHDVHVLC AQGRHVVDSV 
EDGVHVHRRP LLRVPVSRGL GKLGAQLRGP LYPRDSLALR ASLPLSYSLW MRQLGLRPDV
IETQDGETRG LIQAIRRSTP LAIHLHCPTM LTVRLSGQPV GLKGLVADRL DRTSASLAAV
VTSPSQLLVD TLREQGWLGD REVEVIPNPF DAEPWLDIPD VADTTPTVAV VGRLEAYKGV
DVLLDASARL RGAGVAHRLV LAGRSAGEIN GVPSGEWLAR RARQLELDVE FTGHLSGSQI
RDVYRRARVV AVPSRFESFS IAAVEAMAAG RPVVTTTRAG VAPFVERWEA GSSVPAGDPV
ALADALAPFL LDPARASAFG ARGRLGVVEI EPSAIARRKV EAYQRGIAHF DTRNRQQHPR
PSHAAADLYP DLSRERRRHR HWPAARP