Gene Francci3_0319 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0319 
Symbol 
ID3903351 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp371215 
End bp373602 
Gene Length2388 bp 
Protein Length795 aa 
Translation table11 
GC content72% 
IMG OID637877648 
Productglycosyl transferase family protein 
Protein accessionYP_479435 
Protein GI86739035 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCCCA TACTCACCGA CCCGGTGCCC CTCTCCGGAG GCGGTCGGGC CCCTGGACAT 
CGGCGGCGCA TGCCGTGGAC CAGGGACCAG CCCCACCACG TGCCCTCCCC CGACGCCCCC
ACCGAGACCG GGCTCCCGGT CGGCGGCTCC GGTGGCCTCG TATCCGCCGA AGACGTATCC
GCCGAGGACG CCCGCTGGGT CCGACCTGCA CTGGTCACCC TGCTGGTGGG CACGGGTGTG
CTCTACCTGT GGGGCCTCGG CGCGTCGGGA TGGGCGAACG CGTTCTACTC GGCCGCGGTC
CAGGCCGGCT CGGTGAGCTG GAAGGCGTTC TTCTACGGTT CCTCGGACGC CGCCAACTCA
ATCACGGTCG ACAAGCCGCC GGCCTCCCTG TGGGTCATGG CGCTGTCCGT CCGGATCTTC
GGCCTCAGCG CCTGGAGCAT CCTCGTGCCC CAGGCGCTGA TGGGTGTCGC CACCGTCGGC
CTGCTTTACC AGACCGTGCG GCGGCAGTTC TCCGCCGGTG CGGGCCTGCT CGCCGGCGCG
GTGCTGGCCC TGACCCCGGT CGCGGCCCTG ATGTTCCGGT TCAACAACCC GGACGCCCTG
CTCGTGCTGC TCCTGGTCGC CGCGGCCTAC GCGACGTTGC GGGCGATCGA GCAGGCGAGC
ACCCGCTGGC TGGTGTTCGC CGGGGTCCTG GTCAGCTTCG CCTTCCTGAC GAAGCTGCTG
CAGGCACTGC TCGTCGTCCC CGTCTTCGCG CTGGTGTACC TGGTGACCGC GCCGACCTCG
TTCTGGCGGC GCGTCCGGCA GACCCTGGCC GCGGGGCTCG GACTGCTGAT CCCCGCCGGC
CTCTTCATCG CGATCGTGGA GCTCGTTCCG GCGTCCTCCC GCCCCTACAT CGGGGGTTCA
CAGCACAACA GCCTGCTGGA GCTCACCCTC GGCTACAACG GACTCGGCCG ACTTACCGGC
AACGAGACCG GCAGCGTTGG CGGTGGCCGG GGCGGCCGGA TGGGCGGCCT GCCGGAATCC
ATCGGGAACG GCGCGGCAGG CGCCGCTCCC GGCGGCGGCA CGGTCATCTT CGGCCCCGGG
GGTCCGGGCG GCCACGGGGG CGGCATGTGG GGCCAGGCCG GTTGGACCCG CATGTTCGGC
TCGGAAGTCG GCGGTCAGAT CTCCTGGCTG CTCCCGACGG CACTGATCCT GCTCGTCGCC
GGCCTGTGGA TCACCCGGCG CGCCCCGCGG ACGAACCGCG CCCGCGCCGC CTTCCTTCTC
TGGGGCGGCT GGCTGCTCGT CACCGGCATC GTCTTCAGCC AGATGAAGGG CATCTTCCAC
GCCTACTACA CGGTGGCGCT CGCCCCGGCG GTCGGTGCCC TGGTGGGGAT GGGCTGCGCG
CTGCTCTGGC GTCATCGCCG GCATCCCGCC GCCGCCGTGG TCTCGGCTGC CACGATGGCC
GCCACCGCCG CCTGGTGCTA CACCCTGTTG AACCGCACCC CCCAGTGGCA TCCATGGATC
CGGTACGCCG TGCTCGTCGC CGGGATCATC GCGGCGCTCG GGTTCCTCGC GGCCATCCGA
CTGCCGCGGC GGGCCGCGCT CGGTGTCGCC ACCGTGGCGC TGGTCGCCGG GCTCCTCGGA
CCCGGCTCGT ATGCCATCGC CACCGCGGCG ACCCCGCACA CCGGCTCGAT CCCGGCGGCG
GGTCCCGCCG GCGCCGGCTT CGGCGGGCCG GGTGGCGGCC GCTTCTTCCG CATGGGCCAG
GGCGGCCTGC AGGCCGGCCG TCAGCGGGGC TTCCCGCCGC GGGGGATGGG CCGCGGGGCA
CAGGGCGGCC CACAGGGTGG TGGAATCTTC CCCGGCGGGG CCTTCCCCGG CGATGGGCAG
CAGATGCCCG GCGGCCAGGG CGGCATGCCC GGCCAGGCCG GTGGTGGTGC CGGGACCGGA
ACCACCACGC TGGGGAACGG GATCCTCGGC GGGCGGGAGA GCCGGGGCGG TGGCCCAGGC
GGTCTACTCA GCGCCGTCAC TCCCAGCAAC ACGATCGTGA AGCTGCTCAG GCAGAACGCG
GACTCCTACA CCTGGGTCGC CGCCGCGGTC GGTTCGAACA GCGCGGCCGG ATACCAGCTC
GCGACCCAGG ACCCGGTCAT GCCGGTCGGC GGATTCAACG GGAGCGACCC CTCGCCCACC
CTGGCGCAGT TCAAGCAGTA CGTCGCCGAA GGAAAGATCC ACTACTTCAT CGGCGGCGGC
GGCTTCGGTC AGGCCAACGG CGGTAGCTCG TCCTCACGGG AGATCTCCAG CTGGGTGACG
GAGAACTTCA CCTCGAAGAC GGTCGACGGC GTGACCCTCT ACGACCTGAC CGCACCGAAG
ACCGGCGCCA CCACCGGCTC CACGACGGGC ACCGGCGTCA CTGCCTGA
 
Protein sequence
MSPILTDPVP LSGGGRAPGH RRRMPWTRDQ PHHVPSPDAP TETGLPVGGS GGLVSAEDVS 
AEDARWVRPA LVTLLVGTGV LYLWGLGASG WANAFYSAAV QAGSVSWKAF FYGSSDAANS
ITVDKPPASL WVMALSVRIF GLSAWSILVP QALMGVATVG LLYQTVRRQF SAGAGLLAGA
VLALTPVAAL MFRFNNPDAL LVLLLVAAAY ATLRAIEQAS TRWLVFAGVL VSFAFLTKLL
QALLVVPVFA LVYLVTAPTS FWRRVRQTLA AGLGLLIPAG LFIAIVELVP ASSRPYIGGS
QHNSLLELTL GYNGLGRLTG NETGSVGGGR GGRMGGLPES IGNGAAGAAP GGGTVIFGPG
GPGGHGGGMW GQAGWTRMFG SEVGGQISWL LPTALILLVA GLWITRRAPR TNRARAAFLL
WGGWLLVTGI VFSQMKGIFH AYYTVALAPA VGALVGMGCA LLWRHRRHPA AAVVSAATMA
ATAAWCYTLL NRTPQWHPWI RYAVLVAGII AALGFLAAIR LPRRAALGVA TVALVAGLLG
PGSYAIATAA TPHTGSIPAA GPAGAGFGGP GGGRFFRMGQ GGLQAGRQRG FPPRGMGRGA
QGGPQGGGIF PGGAFPGDGQ QMPGGQGGMP GQAGGGAGTG TTTLGNGILG GRESRGGGPG
GLLSAVTPSN TIVKLLRQNA DSYTWVAAAV GSNSAAGYQL ATQDPVMPVG GFNGSDPSPT
LAQFKQYVAE GKIHYFIGGG GFGQANGGSS SSREISSWVT ENFTSKTVDG VTLYDLTAPK
TGATTGSTTG TGVTA