Gene Francci3_3327 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3327 
Symbol 
ID3904113 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3942491 
End bp3945862 
Gene Length3372 bp 
Protein Length1123 aa 
Translation table11 
GC content75% 
IMG OID637880652 
Productpeptidoglycan-binding LysM 
Protein accessionYP_482413 
Protein GI86742013 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3170] Tfp pilus assembly protein FimV 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.172022 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCAGC TCACCCCCAC CCGCCGCCAT GCCGACCATC GAGATCATCA ATGCCGACTC 
GGCCCCCCAC CCGGCCGGAC GCCGGCACGC CGACGCTCAC GGACGGGGGC GACCGCCGGT
GTGGCCCGCC CGTCGGCCGG GCAGGCGCTG CGCGGCCTGG CCGCGCTTCT CGGCCTGCTG
GTCTTCCTCA CCGCCGTGCC GGTGTTCCTG GCCGCCTGGC GGGGAAACCC GCTGCCCACC
ACCTGGGAAC CCGCCCGGTG GGCGACGCTG GCCACCCGGG GCTACCTCCA CCCCGACGTC
GTCCCCGACA CCCTCGCCGT GCTGGCCTGG CTGACCTGGG CCTACCTCAC CCTGTGCATC
ATCGGGGAAG CCCTCACCCA GACCGCCCGG AGCCGACGCG GTCCACACGG CACGGCCCGC
CCCGGCGGCG TGTCCAACGC GCCGACCGTG CCGGAACGCG CCGGATGGCG CCTTCCCGCG
CCGAACGTGG TCCGGCGCGC CACGGCGCGA TGGGTGGCGC TGGCCTCCCT CGCCGTCAGC
CTGCTCTCCT CCCGGGCCGC CCTCGCCGCC ACCCCGACCA CGACCGGCCC GTCCGGGCAC
CCGGCGGCCA CGGCGAGCGC ATCCGCCGAC GTCACAACCG TCCTCGGTGG CACCGCCCCC
GCCCCGGCTG CAGACGTCCT GGCCGGCCCC GTCCAGGCCA GCCCCGGGCC GGGGCCGATC
GGGCATGGGC TACCGGAACC GAGACGGCTC TTCGGGGACA TCTACCGCTG CGGCCCCTAT
GACACCCTGC GATCCGTCGC CGCGCAGTTC CTCGGCAACC TGGACCGGTG GAAGGAGGTC
CGCGACGCCA GCGTCGGCCT GCGCCAACCC GACGGCACCA CCCTCCCACC CGACTTCGTG
GTGATCGGCG ACGGAACCCT GCTACGGATC CCCCTCGCCC CCGCACCGAC CACAGCATCC
GCGACCCCGA ACGCCACCAC CGGCACAGCG AAACCGGCAC GAGGCACCCC ACCGACCGAG
ACGACCCACA CGGTTCGTCC CGGCGAGACG CTCTGGGACA TCGCCCACAA CGCCTACCCC
GACGTCCCGA AGCACGACCT TCCGCACCTG GTCGGGCAGA TCTTCCACGC CAACCAGGGC
GCGACGGATC CCGCCGGACG ACGCCTCCAC GACCCGAACC TCCTTAACCC CGGCATGATC
CTCCGACTAC CCGTGCTCGC CGGCAGCAAC GGCGCGCCGA CGTCCAGTGC CGCGCCGCCC
AGCGCCGGAT CCGCGGCGCC CCCGGGCGAC GGGCCGTCGA CGTCGGACCC GAACGCGCCC
AGCCCACCGG GGATCGTAAG AGCGACGTCG CTGGCACCCG GCCCCACCCG GCCGTCCCCG
AGCGCACCCG CAGCGACCCG GCACGACGAC CACACCACCG CGCCGAACCA CACCACCGCG
CCGAACCTCC TCCCCGTCTG GGTCGGCGCC GCCGGCCTTC TCACCGCGGC CGTCGCGTCC
ACCGTCACCT CGCGGCGCCG ACGGCGCGAC CAAAGCCTCG ACCGGCGCGG CGCATCGCCC
CCGCCCCACC CGCGCACCAT CGCCCTGCAC ACCGCGGTCC TCGCCACCGC CGACCCCGAC
GGGCTGTCCC GCCTCGACGC CGCCCTCCGC GCACTCGCCG CCCAGCACAT CCCCACCGTG
CACGGGCCCG CGGGTCCCAC CGAAGGTCCC GAACCCCTGG TCGTCCTCGT CCGCCCGGAC
GGCACCATCG ACGTCTACCT CCGCCAGCCC CGTCCCGACC CGCCGGTACC CTGGCACGCG
CAGGTCGACG GCCGGATCTG GATCCTGCCT CCCACCGCCC CGCTACCTCC GCTTCCCGAC
CTCCCGCCGC CCTGCCCGCT TCTCGTCCAA CTCGGCACCG AACCCGACGG CGCCGAGCTC
TACGCCGACC TGGAAGCCCT CGGGATCCTC ACCCTCGACC CGGGCGACAC CGGCACCGAA
GGCCTCCGCG CGCTGGCCCG CGCCCTGCTC GCCACCCTCG CCCTGTCCCC CCTCGCCGCA
ACACCCCGCA TCGAGGCCGT CGGCTTCGAC CCCCTCGGCT TCCTCGACGA CGACCGGATC
GACATCGCCG ACGACCTCCC CACCCTCCTC GACCGGATCA TCCCTGACCT GCAGGCCCTC
CACGACGAAC TCGCCGCCAC CGGCCACACC TCCACCTTCG CCGCCCGCGC GGCCGTCCCC
GTGGAGAACT GGGAACCGAC CGCGGCCCTC GTCGTCCTCC CACACCCCAC AGACCACGAC
CCCCACGCCC ACGCCGACCT GATCGACCTC GCCGGCGGCG GCGGCCAGGG ACTCGCCGTC
GTCACCCACC ACCCGGCCAG TCCGGACCCC GACGAGAATG CCGCCACCTG GCGGCTCATC
TTCGACGGAC CGGCAGAACC CGGCGGGGAA CCTCTGTGGC GACTCGACCC CCTCGGCCTA
CGCGTCCTCC CCGCCCAGAT GGCCGCCGAC GAACTCCGCG ATCTGCTCCT TCTCCTCGAC
GACGCCGACC AACCACCCCA CCCCGTCCCC CTGCCACCCG CCGAACCCGA CCCCGCCCCC
TACACCGGCC CCGACTGGCA GGTGATGATC GGCCTCCTCG GCCCCCTGGC CGTCGTCGCC
CGCGACGGCC GCCGACCCTC ACGCGACCTT GCCCGCGAAC GCACCCTGGA AGTCCTCGCC
TGGCTCGCCA CCCACCGCGG CCGCACCCGC ACCGACCTGG AAGCCGCCCT CTGGCCCACC
GGCGCCCAGA CCCGCTCCAT CAACAACCAG CTCGGCCGCG CCCGCAGCAT CCTCGTCGCC
CTCGTCGGCG AACAGGCCCG GCAATGGCTC CCGACCCGCC GCACCACCAT CACCCTCCAC
CCCGCCGTCG TCACCGACCT CGACCTGCTC CACGCCCACA TCCGCCACGC CGAAGCCCAT
CGTCGCCATC CCGAGGTCGC CATCGCCGCC CTCACCGAGG GACTCGACCT TGTCCGCGGC
ACCCCCGCCC GCCACCCGTG GATCGACGCC GAACTCGGCT CCCAGCTCAC CACCACCGTT
GTCCGCGCCG CCCTCCTCCT CGCCGACCTG CACCTCACCC ACGGCGATAC CGCCGCCGTC
CTCGACGCCA CCCGCCGCGG CCTCGCCGTC CTGCCCGCCC ACCCCGGCCT GTTCGCCCAG
CGCATGCGCG CCTACGCCCA CGCCGGCGAC CGCAGCGCCG TCCGCGCCGA ATACCACTCC
TACCTGCGCG CCGAACAAGC CGACCCCCTC TGGGACGGCG CCACCGACCC CGACCTCGAA
ACCCTCTACC GCACCCTCAC CCACAGCACC CACCGCCGGC CACCGGTCGA TCTCCGAATC
CGAAGATCTT GA
 
Protein sequence
MPQLTPTRRH ADHRDHQCRL GPPPGRTPAR RRSRTGATAG VARPSAGQAL RGLAALLGLL 
VFLTAVPVFL AAWRGNPLPT TWEPARWATL ATRGYLHPDV VPDTLAVLAW LTWAYLTLCI
IGEALTQTAR SRRGPHGTAR PGGVSNAPTV PERAGWRLPA PNVVRRATAR WVALASLAVS
LLSSRAALAA TPTTTGPSGH PAATASASAD VTTVLGGTAP APAADVLAGP VQASPGPGPI
GHGLPEPRRL FGDIYRCGPY DTLRSVAAQF LGNLDRWKEV RDASVGLRQP DGTTLPPDFV
VIGDGTLLRI PLAPAPTTAS ATPNATTGTA KPARGTPPTE TTHTVRPGET LWDIAHNAYP
DVPKHDLPHL VGQIFHANQG ATDPAGRRLH DPNLLNPGMI LRLPVLAGSN GAPTSSAAPP
SAGSAAPPGD GPSTSDPNAP SPPGIVRATS LAPGPTRPSP SAPAATRHDD HTTAPNHTTA
PNLLPVWVGA AGLLTAAVAS TVTSRRRRRD QSLDRRGASP PPHPRTIALH TAVLATADPD
GLSRLDAALR ALAAQHIPTV HGPAGPTEGP EPLVVLVRPD GTIDVYLRQP RPDPPVPWHA
QVDGRIWILP PTAPLPPLPD LPPPCPLLVQ LGTEPDGAEL YADLEALGIL TLDPGDTGTE
GLRALARALL ATLALSPLAA TPRIEAVGFD PLGFLDDDRI DIADDLPTLL DRIIPDLQAL
HDELAATGHT STFAARAAVP VENWEPTAAL VVLPHPTDHD PHAHADLIDL AGGGGQGLAV
VTHHPASPDP DENAATWRLI FDGPAEPGGE PLWRLDPLGL RVLPAQMAAD ELRDLLLLLD
DADQPPHPVP LPPAEPDPAP YTGPDWQVMI GLLGPLAVVA RDGRRPSRDL ARERTLEVLA
WLATHRGRTR TDLEAALWPT GAQTRSINNQ LGRARSILVA LVGEQARQWL PTRRTTITLH
PAVVTDLDLL HAHIRHAEAH RRHPEVAIAA LTEGLDLVRG TPARHPWIDA ELGSQLTTTV
VRAALLLADL HLTHGDTAAV LDATRRGLAV LPAHPGLFAQ RMRAYAHAGD RSAVRAEYHS
YLRAEQADPL WDGATDPDLE TLYRTLTHST HRRPPVDLRI RRS