Gene Francci3_4533 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4533 
Symbol 
ID3907510 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp5413250 
End bp5416006 
Gene Length2757 bp 
Protein Length918 aa 
Translation table11 
GC content72% 
IMG OID637881866 
Productintegral membrane protein MviN 
Protein accessionYP_483608 
Protein GI86743208 
COG category[R] General function prediction only 
COG ID[COG0728] Uncharacterized membrane protein, putative virulence factor 
TIGRFAM ID[TIGR01695] integral membrane protein MviN 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCCGG TCGAGGAGGA GGGCCGGCCT GGGGCCCGGC ATCGCAGGGG TGCTACCGAC 
CCCCGGCGTG ATCGCGGTAC CGGCGGTGGA CCGTCGGACC GGGACCCACC CGATGTCGGG
TACTGCGACG AGGACGGCTA CCCGGCGGCC GAACGCCCGG CCCGGTTCCC CCCGGACCAG
TCGTATCCCG GTTATCCCTG GCAGACCCGT CCGGAGCAGC CAGATCCGGA CCGGGGACTC
GCGCGCCGGC CCCGGCACTC CGAACGGGAC CATCCGGAAC GGGACCATCC TGTCGACCCG
GACGGACGCC GTGGCGCGCC ACGGTTCGAC GCGCCACGGT TCGACGCGCC ACGCCATGAA
GACCGGCGGC AGGGCGACCC ACGGTACGAC GGCGCGCGCT ATGACGGCCC GCGCCACCCG
GACCCGCGCC ATGACGGCCC GCGCCACCCG GACCCGCGCC ATGACGGCGC GCGCTATGAC
GGCCCGCGCC ACCCGGACCC GCGCCATGAC GGCGCGCGCT ATGACGGCCC GCGCCACCCG
GACCCGCGCC ATGACGATCC CGCGAGCTCC CGCGACGAGC GGTATCCGCG ATCCGCAGAC
CGCCTCCCCG CTCCCGGCGC TCCGCGCATG ACCGCCGGCG AGCAGACCCG GAAGCTTCCC
GCCACGTCGG GCCCACCACC GCCGCCTCCC CATGTGCCGC CTCCGTCGGA AGGAGGTCCC
CCGCCGGGGC CACGCATACC GGAACGATCG CAACGCGGCA CCCCGCCGCT GGACGACGGG
CGGACGAGCC GGCTGCCGAC CCCGCCCGGG GTCGGCCCTG TCATCCCCAT CGGACGATCC
TCATGGCGGG ACACCCCAGA CCGCCCAGAG CCTGCGGCGC GTCCGGAGCG ACGCACCCGA
GCCGACCGTT CCCGTCGGGA CGGCGACGAG CGCCGTCGGC CGTCCTTGAG CGAGTCCAGG
CCGACGGTGG CACTTCCTCG CAGCAACGTC CGCCGCATCC CCCCCGAACC GGCCCGACGA
GACACCGGCA CGGCGCCGCA CGGCAGCACA ACGGCGCGGC GCGACAGCAC AACGGCGCGG
CGCGGCAGCG CAACCATGCA GGCCGGCAGC CCGGCCGAGG CCGGGCCCGG CCTCGGCCGG
GCCAGTGGAA CCATGGCGAT CGGCACGATC GTCTCCCGGG CATCCGGGTT CCTGCGGACC
GTCGCGATCG CCGCGGCGAT CGGGACCGGT GCGGTGAGCC AGGCCTACAA CGTCGCCAAC
ACCACGCCGA ACATCCTCTA CGACCTGCTC CTCGGCGGCG TCCTCACGAG CGTCGTGGTG
CCGGTCATGG TCCGCACCGC GAAGGAGGAT CCCGACGGGG GGGACGCGTT CGCGTCGTCG
CTGCTCACAC TCATGATCCT CGGCCTCGGG GCCGTCGTGG CAGTGGGCAT GCTGATCGCC
CCGTGGATCA TCAGTCTCTA CCTGCACGCC GGCTCCGACG AGCGGGCGCT CGCCGCCACG
ATGCTGCGGT GGTTCCTGCC GCAGATCGTC TTCTACGGGG TAGGCGCGAC GATCGGGGCG
ATCCTCAACG TCAGACAGTC GTTCACCGCG CCCATGTTCG CCCCGATCCT GAACAACCTG
CTGGTCATCG TCACCTGTCT GGGCTTCACC TATTTCATCG CGGGGCCGCG CCCGCCGGGT
GTGGACGGGC CGAAGGCCAT CACCGACACT CAGGTGACGG TGCTCTGCGC CGGGACGACG
CTGGGCGTCG TCGTCATGAC CCTCGCACTG CTCCCCTCGC TGCGCAAGGT CGGTTTCCAC
TACCGGCCGC GGTTGGACAT GCGGCATCCC GAACTGCGCT CGACGGCCCG GCTGGCGGGG
TGGACGCTGC TGTTCGTCGT GATCAGCCAG GCCGGCTATC TGGTGATCGT CAACCTGTCG
ACGGCGACGA CCGCCTACAC GATCTACACG TACGGATACC AGATCTTCCA GCTCCCGTAC
GCCATCATCG GCGTCTCGGT CATCACCGCG CTGTTGCCGA GGATGAGCGG GCACGCCGCC
CAGGGGCGCA GCGACCTGGT GCGGGCCGAC ATCTCGATGG CAACCCGGAT CACGATCACC
GCGATCGTGC CCGCCGCCCT GTTCATCCTC GCCCTGGGCC GGCCGATCGC CGTCGCCGTG
TTCCATCACG GCGCGACCGG CTACGGCGAC GCTGTGGACA TCGGCGACAC CCTCAGCGCC
TTCGCCCTGG CCCTCGTGCC GTTCTCTCTG TTCCAGGTGC AGCTACGGGT GTTCTACGCC
TACCAGGACA GTCGCACCCC GGCCCTGGTC AACATCGGGG TCGTGGCGGT GAACGTCATC
GGTGCCCTGG TCCTCACCGT GGTCGTGCCC GAGGAGCACC GGGCCGTCGC GCTGGCGCTC
TCCTTCGCCA CCGCCTACCT GATGGGCCTC GCTGCGACGA GCGCCCTGTT GCACCGGCGG
CTCGGGGGAA TCGATGGCAA TCGCACCGTG CGGGTCATCA CCCGGGTTGC GATCTCGGCC
GGGATCGGCG CCGTCCTCGC GTCGCTTCTC GCCCGAGGTG TCCGCGCGGT CGTGGGTGAG
GGGTGGCTTG GTTCCGGGAT CGCGGTGGTG CTCGCCGCGG CCGTGGGAGG CGGGCTCTTC
CTCGCGGTCG GCACCCGGAC GGGGATACAC GAGATCAACG CCCTGCTGGC GGAGGTGAAC
GGACGACTCG GCGGCCGGCT GCCCCGACCG CCGGGACGGT CATCGCAACG CGACTGA
 
Protein sequence
MTPVEEEGRP GARHRRGATD PRRDRGTGGG PSDRDPPDVG YCDEDGYPAA ERPARFPPDQ 
SYPGYPWQTR PEQPDPDRGL ARRPRHSERD HPERDHPVDP DGRRGAPRFD APRFDAPRHE
DRRQGDPRYD GARYDGPRHP DPRHDGPRHP DPRHDGARYD GPRHPDPRHD GARYDGPRHP
DPRHDDPASS RDERYPRSAD RLPAPGAPRM TAGEQTRKLP ATSGPPPPPP HVPPPSEGGP
PPGPRIPERS QRGTPPLDDG RTSRLPTPPG VGPVIPIGRS SWRDTPDRPE PAARPERRTR
ADRSRRDGDE RRRPSLSESR PTVALPRSNV RRIPPEPARR DTGTAPHGST TARRDSTTAR
RGSATMQAGS PAEAGPGLGR ASGTMAIGTI VSRASGFLRT VAIAAAIGTG AVSQAYNVAN
TTPNILYDLL LGGVLTSVVV PVMVRTAKED PDGGDAFASS LLTLMILGLG AVVAVGMLIA
PWIISLYLHA GSDERALAAT MLRWFLPQIV FYGVGATIGA ILNVRQSFTA PMFAPILNNL
LVIVTCLGFT YFIAGPRPPG VDGPKAITDT QVTVLCAGTT LGVVVMTLAL LPSLRKVGFH
YRPRLDMRHP ELRSTARLAG WTLLFVVISQ AGYLVIVNLS TATTAYTIYT YGYQIFQLPY
AIIGVSVITA LLPRMSGHAA QGRSDLVRAD ISMATRITIT AIVPAALFIL ALGRPIAVAV
FHHGATGYGD AVDIGDTLSA FALALVPFSL FQVQLRVFYA YQDSRTPALV NIGVVAVNVI
GALVLTVVVP EEHRAVALAL SFATAYLMGL AATSALLHRR LGGIDGNRTV RVITRVAISA
GIGAVLASLL ARGVRAVVGE GWLGSGIAVV LAAAVGGGLF LAVGTRTGIH EINALLAEVN
GRLGGRLPRP PGRSSQRD