Gene Francci3_4074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4074 
Symbol 
ID3907037 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4878337 
End bp4881006 
Gene Length2670 bp 
Protein Length889 aa 
Translation table11 
GC content72% 
IMG OID637881401 
Productmembrane protein-like 
Protein accessionYP_483151 
Protein GI86742751 
COG category[S] Function unknown 
COG ID[COG1289] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGACG GATGGCTGGG ACGGCTCACC AGAGCGTTCC CCGACGACCC CGGGCTGATC 
AACCTCAAGG CGGCCGTCCG GGTTGCCTTC ATCGCCCCCA CGCTGCTGGC CGTCACCTAT
CTGGCTGGTG GGGACGTGCG GCTGAGCCTG TTCGCCTGGT TCGGCGCCTA CACCCTGCTG
GAGTTCATCG ACTTCACCGG GCCGACCAAG ACCAGGCTGT TGGCCTACGT CGCGTTCGTG
CTTATCACCG TCTCGCTCAT CGTCATCGGC GCGTTGTGCT CCCGGACACC TTGGCTGGCG
GCACCGGTGA CCGCGCTGGT CGCCCTGGTC GTGCTGTTTT CCGGGGTCCT CAACGCGTAT
TTCGCCGCCG CCGGACGGGC CACGCTGATG GCCTTCGTCC TGTCGGTGAT GACGCCTGGC
CCGGTGTCCG CCATTCCGGA ACGGCTCGCC GGCTGGGGGG CCGCCATGGT GGTGGCCGTC
ACCGCGGCCA TGGTGCTGTG GACGGAGCGG CCCCCAACAC GGCTGCGTGC CGCGCTGGCC
CAGGCGTGTC GGTCGATGGC CGCCGGGGTG GCGTGGACGA CCATGCCTTT TCCGCAGTCT
GACGGGACCG GGGCGGACGG GACCGGGGCG GACGAAGTTC CCCGGATCTG GACCGACGTG
TTCCACCTGC GCCGCAGGTT CGCCGAGACC GCGCACCGGC CCAGCGGTGT CGGCGGGCGC
ACCGCGGCGC TGGGGTATCT CGTCGTGGAC GTCAACTGGC TGGTCCCGTT CGCCGAACCG
CGTGCGGACC GGAATCGGAT CGCGGCGGCA TGCTTCCCCC GCGAAGCTGC CGAGATCCAC
ACCGCCGCCG CTGCCACGCT CGCCGCCGCG GCCGATCGAC TCGACCGCGG TCCACGGCCG
AGCGAGAGGC TCGGCCTGGA CCGTTTGGAG CGGGCTGAGC GGAGGATGCG CACGGCCCTG
CTGGACTACC TGGAGGGGCA AGTCAGCCGG CCGGCGCCGG ACGCTGACGG GTGGAGCGCC
CGATCGGCCC AGAAGATGGA TGATCTCGTG GTCGCCGAGT CGGAGCAGCC GCCGGCGGAC
CTCGCGATCG CCGAGGCGTT CCGGCTGCGC AGGCTGGCCC GGGGGACCCG GGAGCTGGCG
ACGAACGTGC TGCAGGTGAC CGCGCCGGCA CCGGCGCTGC GCTCCTGGGT GCGGCCGCGT
GGCTGGCCCG CCAGGCTAAG GCGGCTCACC CGGCGAGGCG CCGCCGCCAC CGATCTGGCC
GCCGGATATG CCAGCATCCG GTCGGTGTGG TTCCGCAACA GCGTCCGCGG CGCCGTGGGG
CTCGCCCTCG CTGTCGCCCT GGCCCAGAAC ACCGAGGTGC AGCACGGCTT CTGGGTCGTG
CTGGGTACCC TGTCGGTGCT GCGGTCGAAC GTGGTGGCGA CCAGCTCGAC GATCCTGCGG
GACCTGCTGG GCACCGGGAT CGGCATTGTG GTGGGTGGCC TGTTCGTCGC CCTGATCGGT
ACCCACACCG TAGTATCGTG GCCGCTGCTG CCGCTGGCTG TGTTCGTCGC CGGCTACATG
CGACGGAAGT CGTTCTTCGC GCTGGGCCAG GCAGGCTTCA CCGTTGCCAT CCTCATCCTT
TTCAACATCG TTGAACCATT AGGCTGGCGG GTGGGGCTGG TCCGCATTCA GGACGTTATG
ATCGGCTTCG GGGTTAGCCT GGCCGTGGGA GCGCTGCTGT GGCCGCGAGG AGCGGCGGCG
GTAATCCGGC ACCGTGCCGC CGCGGCCTAT CGGATGGGCG CGGCTTTCCT CGCCCTGGTC
GTCATGCGGG CACCCGGTGA TGACGATCCG CCCAGCGCCC GGCCGCGAAC CTCGCCCTGG
CTGGACCCGG CCTATACCGA CATGTCGGGC CATCCGGACG GCCCCGCGGA CACCATCGGT
GTGCTCGTGA GCCGGGTCGG CGCCTCCCGA ACAGCCGGCG CATCCCAAAC AGCCGGCGCA
TCCCGAACAG AGGACGCCGA CACCGCGGGC TCGGACGATG CGGCCCTCCG CCGTCCCAGC
CGGGACCAGC AGGTGGCCGC CGCGGCCCGC GATGCCATCC GCGCCGGGCG GCTGCTCGAC
GACGCGGTCC GTCAGTACCT GTCGGAGCAG CCTGACGGCC GGCTCGACGT CGACGCGCTC
ATGACCGTCG TCGGCGGGGC GCTGCGGCTG CGCCGGACCG CGCAGCTGCT GCACGCCGGC
GACGTGCCCT GGCCGGCCGA CATCCTGCGG GCCGGGGGCG GCGTCAACAT GATCGCCTCG
ACCATAGAGA TCTCCGCCGC GCTGCCCGAT CTGGCCGCAG CCCAGGAAGC GGTCACGGTG
GAGACGACCG AGCTGTGCGA CTGGTATCTG CGGTTCGCCG AGGCACTCGG CGGCGGCAAG
CCTCCGCCGC CGGCAGACCC GAACCCCACT CCTGGCCCGG CCAGCATGGC GGCGCTGGCC
GTGGTGCGCC ATGGCGCCGC CGCGCACCGA CGCCCGGAAC TACGCGCCGG CGTGGCGCTC
GCAGCACGCG CGACCTATCT CGACATCCTG CGTGGTCTGC AACCGATGCT CACTGACGCG
GGCCTGGCGC TCGTTCACCA GGGCCCCGCT GGACGCCGCG GCTGGGCGGC CGACGTACGA
CCCCGGCGGA CTCGCCGGAT GCCACGCTGA
 
Protein sequence
MQDGWLGRLT RAFPDDPGLI NLKAAVRVAF IAPTLLAVTY LAGGDVRLSL FAWFGAYTLL 
EFIDFTGPTK TRLLAYVAFV LITVSLIVIG ALCSRTPWLA APVTALVALV VLFSGVLNAY
FAAAGRATLM AFVLSVMTPG PVSAIPERLA GWGAAMVVAV TAAMVLWTER PPTRLRAALA
QACRSMAAGV AWTTMPFPQS DGTGADGTGA DEVPRIWTDV FHLRRRFAET AHRPSGVGGR
TAALGYLVVD VNWLVPFAEP RADRNRIAAA CFPREAAEIH TAAAATLAAA ADRLDRGPRP
SERLGLDRLE RAERRMRTAL LDYLEGQVSR PAPDADGWSA RSAQKMDDLV VAESEQPPAD
LAIAEAFRLR RLARGTRELA TNVLQVTAPA PALRSWVRPR GWPARLRRLT RRGAAATDLA
AGYASIRSVW FRNSVRGAVG LALAVALAQN TEVQHGFWVV LGTLSVLRSN VVATSSTILR
DLLGTGIGIV VGGLFVALIG THTVVSWPLL PLAVFVAGYM RRKSFFALGQ AGFTVAILIL
FNIVEPLGWR VGLVRIQDVM IGFGVSLAVG ALLWPRGAAA VIRHRAAAAY RMGAAFLALV
VMRAPGDDDP PSARPRTSPW LDPAYTDMSG HPDGPADTIG VLVSRVGASR TAGASQTAGA
SRTEDADTAG SDDAALRRPS RDQQVAAAAR DAIRAGRLLD DAVRQYLSEQ PDGRLDVDAL
MTVVGGALRL RRTAQLLHAG DVPWPADILR AGGGVNMIAS TIEISAALPD LAAAQEAVTV
ETTELCDWYL RFAEALGGGK PPPPADPNPT PGPASMAALA VVRHGAAAHR RPELRAGVAL
AARATYLDIL RGLQPMLTDA GLALVHQGPA GRRGWAADVR PRRTRRMPR