Gene Francci3_1694 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1694 
Symbol 
ID3903271 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2032640 
End bp2035138 
Gene Length2499 bp 
Protein Length832 aa 
Translation table11 
GC content74% 
IMG OID637879032 
ProductType IV secretory pathway VirD4 components-like 
Protein accessionYP_480799 
Protein GI86740399 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3505] Type IV secretory pathway, VirD4 components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000925009 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0597569 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCTGC TCGCCGCGGC GACCACCACG CCGTCGTCGC CGCTCACGAT CTATCTGACG 
GACCCGTGGG GCTTCCTCCA CCAGATCTTC GGCCAGCTAC GGGGATGGGT GGCGGTGTGG
GGCCCCATCG CCGGCCCGCT GCTCACCCTC ACCGCCGCCG GCCTGGTTAC CTTGCGCCGG
CGGCTGCGCC GCCGGTACCA GCAGCAGCTC GCCGCAGGCG CCCGCCTCGT GACCGTGCTG
GCCCCGCCCA CCGTCGACCC GGCGGGCGCG GGCGCGCTGT GGGCGAACCT GCTCGGCCTG
CTCCGGCCCG GCTGGCGGCG CCTGATCGGC CAGCCGCACC TCGTGTGGGA GTACCAGTTC
ACCGCCGACG GGGTGCGCAT CCAGATGTGG GTGCCCGGCG TCGTGCCCGA CGGCTTCGTC
GAACGCGCGG TCGAGGCGGC CTGGCCCGGA GCGCATACCC ACACCACGCC CGCCCGCGCA
CCATTGCCCG TCGTGGCCCG GCCAGGCCGG CGGCTACTCG CCGCCGGCGG CGAACTGCGC
CTCGCCCGCC CCGAAGCACT CCCGATCCGG GTCGACCACG ACGCCGACCC GATCCGCGCC
CTGCTCGGCG CGCCCGGCAG CCTCGCCCGC AACCAACGGG CGGCGGTCCA GATCCTGGCC
CGGCCGGTCA CCGGCCGCCG CGTCGCCCGG TCCCGCCGGG CCGCCCGCCG GCTGCGCGCC
GGCGGCTCCG CGCACCTGAT CGGCGGGCTG CTGGACCTGC TCACCCCCCG CACTGGCCGC
ACCCGGCAGC GCCGCCGGAC GGCTACCACC CCGGTGAAGG TCGACCCGCA GACGTCGCTG
GCCCTGTCCG CGGAAGACCG CGCCATCGTC ACGAAACAGC GCGGGGCCCA GTACGAGGTC
CGTGTCCGCT ACGCCATCGC CGCGATCCTC GACGATCACA CCGACGACAC CACCGCCGCC
CAGGTCGCTA GCCAGCTGCG CGGACGGGCG CACGCGATCG CGAGCGCCTA CGCGGCCTAC
GGCGACCACA ACTACTACCG CCGCGTGCGG CTGCGCCGTC CCCTGCCCGT CCTCGCTACC
CGGCAGTTCG GCCGTGGCGA CCTGCTCTCG GTCGCCGAGC TCGGCGCGCT GGCGCACCTG
CCGGTCGACG AGGCGACCCC GGGCCTGCAA CGCGCCGGAG CGAAGGCGGT CGCCCCGCCG
CCTGGCGTCG CCGGGCCCGG CCCGAATGTG CGCCCGCTCG GGCGGACCGA TGCCGGGCGT
GCGCGTCCGG TCGGTCTGCG GGTCCCGGAC GCCCGGCATC ACCTGCACGT CCTCGGCGCG
ACCGGCGCCG GCAAGTCCGA ACTGCTCGCC CGCATGACGC TCGACGACGT CGCCGCCCGT
CGAGGGGTGG TCAACGTCGA CCCGAAGGGC GACCAGATCA TCGACATCCT CGCCCGCTAC
CCGACCGACG CCCTCGACCG CCTCGTCCTG TTCGACGCCG AATCGAACAG CCGGCCACCC
TGCCTCAACC CGCTCGACCA GCCCGACCGG GGCCGGGCCG TCGACAACCT CGTCTCGATC
TTCTCCCGGG TCTACGCCGA CAGTTGGGGG CCGCGCACCG AGGACATCTT CCGCGCCGGC
CTGCTCACCC TCGCCGCACA ACCCGGCGTC CCGGTGCTCA CGGATCTACC GAAACTCCTC
ACCGACACCG CCTACCGGCA CCGGGCATTG GGCGAGATCA ACGACGACAT CCTCGCCGGC
TTCTGGACCT GGTACGAATC CATCTCCGAC GCCGCACGCG GCCACGCCGT CGCCCCACTG
ATGAACAAAC TGCGCGGCTT TCTGCTGCGG CCGTTCGTGC GGGCCGCGAT CGCCGCCGGA
CCCTCGACCG TCGACATGGA CACCGTGCTG AACGACGGCG GGGTGTGCCT GGTCCGCATC
GCCCAGGACG CCCTCGGCGT CGAAACCGCT GCCCTGATGG GCTCGATCGT CGTCTCCGCC
GTCTGGCAGG CCACCACCCG CCGCGCCCGC CTCCCCCAAG GGAAAAGGCC CGACGCCAGC
CTGTACTTGG ACGAGGCGCA CCATTTCCTC ACGCTTCCGT ACGCGTTGGA GGACATGCTC
GCCGCCGCCC GCGGCTACCG GCTGTCCCTC ACGTTGGCGC ACCAGAACCT CACCCAGCTG
CCTCGGCACC TGGAAGAAAG CATCGGCGCC AACGCCCGCT CCAAGATCTA TTTTACGGTC
AGCCCGGCGG ACGCGAAGCG GCTCGCCCGG CACACCGAGC CCCGGCTCGC CGAACACGAC
CTCGCCAACC TCGGCGTGTT CCATGCCGCC GCCCGCCTGG TCGTCGGCGG CGAAGAAGCC
CCCGCCTTCA CGGTCGTGAC CGAAAAGCTG CCGCCGCCGG TTCCCGGCCG CGCCGCCCAG
ATCCGCCGAG ACCTGCGCCG CCGCGCCGCC ACCCCCGCCG CACCTTCCCC TGCCGGTCCG
GGTCCGCGCC CGACCGCCGA CCCGCGCCGC GTCGCCTGA
 
Protein sequence
MDLLAAATTT PSSPLTIYLT DPWGFLHQIF GQLRGWVAVW GPIAGPLLTL TAAGLVTLRR 
RLRRRYQQQL AAGARLVTVL APPTVDPAGA GALWANLLGL LRPGWRRLIG QPHLVWEYQF
TADGVRIQMW VPGVVPDGFV ERAVEAAWPG AHTHTTPARA PLPVVARPGR RLLAAGGELR
LARPEALPIR VDHDADPIRA LLGAPGSLAR NQRAAVQILA RPVTGRRVAR SRRAARRLRA
GGSAHLIGGL LDLLTPRTGR TRQRRRTATT PVKVDPQTSL ALSAEDRAIV TKQRGAQYEV
RVRYAIAAIL DDHTDDTTAA QVASQLRGRA HAIASAYAAY GDHNYYRRVR LRRPLPVLAT
RQFGRGDLLS VAELGALAHL PVDEATPGLQ RAGAKAVAPP PGVAGPGPNV RPLGRTDAGR
ARPVGLRVPD ARHHLHVLGA TGAGKSELLA RMTLDDVAAR RGVVNVDPKG DQIIDILARY
PTDALDRLVL FDAESNSRPP CLNPLDQPDR GRAVDNLVSI FSRVYADSWG PRTEDIFRAG
LLTLAAQPGV PVLTDLPKLL TDTAYRHRAL GEINDDILAG FWTWYESISD AARGHAVAPL
MNKLRGFLLR PFVRAAIAAG PSTVDMDTVL NDGGVCLVRI AQDALGVETA ALMGSIVVSA
VWQATTRRAR LPQGKRPDAS LYLDEAHHFL TLPYALEDML AAARGYRLSL TLAHQNLTQL
PRHLEESIGA NARSKIYFTV SPADAKRLAR HTEPRLAEHD LANLGVFHAA ARLVVGGEEA
PAFTVVTEKL PPPVPGRAAQ IRRDLRRRAA TPAAPSPAGP GPRPTADPRR VA