Gene Francci3_0721 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0721 
Symbol 
ID3903511 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp829084 
End bp830688 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content72% 
IMG OID637878054 
Productglycosyl transferase, group 1 
Protein accessionYP_479834 
Protein GI86739434 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.920452 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.491923 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGATC CCGCGAGAGC CGGCCAACCG GCCGAGAACG ACCAACCGGC CGACGGGGTG 
CGGGCGGCGG TCTACAACCG GTTCTGGCAC TCCATGGGCG GCGGGGAACG CCACAACGGC
ATGATCGCCC AGGTACTGGC CGCCGAGGGC GCCGTGGTCG ATCTCCTCGG GCACTCCGAG
GTCGACCTCG CGGCGCTCGG CGAGCATCTC GGGCTGGATC TCGCCGACTG CCGGTACGTG
CGGTTGCCGG ACCGGGGCGA GGAGGCCATC GCCGTCCTCT CCAGACGCTA CGACCTGTTC
GTAAACGGTT CGTACATGAG CCGGATCATG CCGAAGGCCC GCCACTCGGC GTACCTGTGC
TTCTTCCCGA CGCCGTTCGA CCACGACATG GCGGCCTGGC GCAAGGCCGC GGTCCGCACG
GCCGGGCCGC TGCTGCGCGG GGTTCGCCCA GCGGTGAGCT TCGGCCAGGG CTGGTACCCG
CCGGAGGGCG GCCGCCGCCG GCAGTGGACC TGGACGAACG GGGCGGGCAT CCTCGCCGTC
AGCCCGGGCA AGCGGCGCAC CCTGCGCGCC GATATCGGCC GGCCCGGCGC GCCCGTCGGT
ACGGCGTTGC GCCTCCTCGA CGCCGACGGC GGCGTGCTGG CCAAGCTGCG GATCGGCACC
GACTTCACCC CGTTCGAGGT ATCGCTGCCG CCGTCGGCGA ACGGTACCGA GCTCACCCTG
GTCAGTGACG CGTTCTCGCC AGGCGCGGCG GACGTGCGCG AACTCGGCGT GGCGGTGAGC
CGCCCCCGGG TCACCGATGG TGGCGAGGGG CCGATCGCCC GGCTCCCGCT GCGTTTCCCC
TGGCTGCTGC GGGACCCGGC CGACCTCGGC TACCTCGACG GCTACGACGT GGTCATGGCC
AACTCGCGGT TCACCCGCGG CTGGATCCGC CGGTTGTGGA AGCGCGACGC CGACCTGCTG
TTCCCCCCCA TCCAGGTGGA ACGGTTGCAT CCGGCGCCGC GGCGGGAGAA GGCCGTCGTC
ACCGTCGGCC GGTTCTTCGC CCCCGGCCTC GGCCACGCCA AGCGACAGCT GGAGATGGTG
CAGTGGTTCG GCGACCTGTA CCGTGCGGGC AACCTGCCCG ACTGGACGAT GCACGTCGTC
GGCGGCTGTG AGGACTCTCA ACTGCCCTAC CTGGAACAGG TCCGGGCGGC CGGCGCGGGG
CTGCCGGTGG AGATCCATCC CAATGCCCCG CGCGCCGAGG TCGAACGGCT GCTCTCGACC
AGCTCGGTGT TCTGGTCGGC CACCGGGTAC GGCGAGGACG ATGACCGGCG TCCCTGGACG
GCGGAGCACT TCGGGATGAC CACCGTCGAG GCGATGGCCG GGGGCTGCGT TCCCGTCGTC
ATCGACCGGG CCGGCCAGCG AGAGATCGTC CGGCACGGAA TCGACGGCTA CCGATGGACC
GGCCCGGAGC AGGTTGCCTC CTTCACTCGC CGGCTCGCCG CCGAGGACGG TCTACGCGGT
CGGCTCGCCG CCGCGGCGAT CGAACGCGCC CAGACCTTCT CCGATGCGGC GTTCGCGCGG
CAATGGCGGG AGATCGCCAT CCGGCACGGG TTGTACGAGC GGTGA
 
Protein sequence
MNDPARAGQP AENDQPADGV RAAVYNRFWH SMGGGERHNG MIAQVLAAEG AVVDLLGHSE 
VDLAALGEHL GLDLADCRYV RLPDRGEEAI AVLSRRYDLF VNGSYMSRIM PKARHSAYLC
FFPTPFDHDM AAWRKAAVRT AGPLLRGVRP AVSFGQGWYP PEGGRRRQWT WTNGAGILAV
SPGKRRTLRA DIGRPGAPVG TALRLLDADG GVLAKLRIGT DFTPFEVSLP PSANGTELTL
VSDAFSPGAA DVRELGVAVS RPRVTDGGEG PIARLPLRFP WLLRDPADLG YLDGYDVVMA
NSRFTRGWIR RLWKRDADLL FPPIQVERLH PAPRREKAVV TVGRFFAPGL GHAKRQLEMV
QWFGDLYRAG NLPDWTMHVV GGCEDSQLPY LEQVRAAGAG LPVEIHPNAP RAEVERLLST
SSVFWSATGY GEDDDRRPWT AEHFGMTTVE AMAGGCVPVV IDRAGQREIV RHGIDGYRWT
GPEQVASFTR RLAAEDGLRG RLAAAAIERA QTFSDAAFAR QWREIAIRHG LYER