Gene Francci3_2675 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2675 
Symbol 
ID3904899 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3159206 
End bp3161134 
Gene Length1929 bp 
Protein Length642 aa 
Translation table11 
GC content73% 
IMG OID637880000 
Producthypothetical protein 
Protein accessionYP_481766 
Protein GI86741366 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.196896 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.830028 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCACGG CGTTCCACCG GCGGGTCGGC GCGGTCCGGC CCAGCCACCT GATGTTCACC 
AGCGGTGTCG GCGCGCTCGT CGACCTGCCG AACTTCTCCG CGATCGTCCG CGGCGTTGAC
GACTGGGACT ACCGCAACGT CCCCGAGTAC CAGCCGATCG TCGAGCCGCG GTTGCTCGCC
GCCGTGCGGC GGCTGTACGA CCGGGGCGTC ACCGAGCTGC GCCCGGCCCC ATGGGCGGAG
ACCGATGCGG GCGACCGCAG CGGGCAGGGC GCCCGCATCG GCGTGCCCAC CGTGCCGTTC
CCGAACTGGC TGCGCTGCAC CGCCTGCAAC GAGCTGGACG TGATCGCCTC CCGCGCGTTC
GGGTTCGAGA ACCCGTGGCT GACCCGGCCC CAGGACGCCC GCTTCTTCCA CGGCACCTGC
GGGCGGAAGA AGACCGGCCG CCGGCCGCTC GCGGTGGCGG CCCGTTTCGT GCTCGCCTGC
ACCGCGGGTC ATCTCGACGA CTTCCCCTAC CCGATGTTCG TGCACCGGGG CGCGCAGTGC
GCCAGGGCGA GTCATCCCCG GCTGCGGATG GAGGACCGTG GCGGCAACCT CGGCGCCAAC
GTCGAGATCC GGTGCGTGAA CTGCGACGCC CGGCGCAACA TCCGGGAGGC CATGGGTCGT
CGCGGTGCGG GGCACCTGCC ACGCTGCCGG GGCCGCCATC CCCACCTGGG GACCTTCGCC
GATGGCGGGT GCCCGCAGGA GTCCCGCCTG CTCGTCGTCG GCGCCTCCAA CCAGTGGTTC
GCCCAGACGC TGTCGGTGCT GTCGGTGCCG CGGACTGGCG CCAGCGAGCT GGCGGTCAAG
GTCGAACAGC ACTGGGAGGT CTTCGAAAAG ATCAACAATA TGGCGATTCT GGACTTTGCC
CGCTCAGCGC CGATCCCGGT GTTCCGTCAG CTGGAGAAGT GGTCGAACGC CGAGATCTGG
GCCGCGGTCG AGCGGCATCG CGCCGCGATC GAAACCGGCC CGGCCCCGGG TTCCGACGCC
GACGGCGAGC GGGCCTACCC GGATCTGCGC ACCCCGGAAT GGGAGATCTT CTCCTCCGCC
GTCCTCCCGG AACCCACCGA CGATTTCGCC CTGCGCCGCG AGCCCGGCGG TGTGCCGGCA
CCGCTTGCCG GCTGCTACGC CGACGTCGTG CAGGCCGAAC GCCTGCGGGA GGTACGGGCG
CTGACCGGGT TCACCCGGCT GGACGCGCCC GACCCGGACG ACCCCGACCT CGTCATCCGG
GCGCCGCTGG CCCGTGGGCC GGCTTCCTGG GTACCGGCGA GCGAGGTCCG CGGCGAGGGC
ATGTTCCTGC GGGTGCCCGA GGAGATCCTC GGCCCGTGGG CGCGCCGCGT CGCGGACAGT
GTCGCGATGC GCGAGCACCG CGACGCCTAC AGCCGATTCC GGGTGAACCG CTACTCCGAC
CGGATCGCCG GGGACTTCGA CCCCATGCAG GGCTGGCCCG GGGAGCGCTT CCTCGCCCTG
CACACCCTGT CCCACCTGCT TATCCGGACT ATCGCGCTGG AGTGCGGCTA CAGCTCGGCC
AGCCTGAGCG AACGCATCTA CGCCGGGGAC GACGGCGACC CGCGCGGCGG CATCCTCATC
TACACGGCCG TCCCGGACGC GGAGGGCACC CTGGGCGGGC TGGTGTCGCT GGCCGAGCCC
GACCAGCTGC TCCGGCTGAC CCGCCGGGCC CTCGCCGACG CCCGGCACTG CTCGTCCGAT
CCGCTGTGCG CCGAGCGGCT GCCGTCGGCG CCCGCCGACT TCCTGCACGG AGCCGCCTGC
CACGTCTGCC TGTTCGTGTC CGAGACGACC TGCGAGCGTG GCAACCGCTT CCTCGACCGC
CGGTTCGTCG TGCCCCTCGG AGACGAGCCG CAGGACCGCG CCCTCGCCCT GTTCGGCGGG
CTGCCGTGA
 
Protein sequence
MVTAFHRRVG AVRPSHLMFT SGVGALVDLP NFSAIVRGVD DWDYRNVPEY QPIVEPRLLA 
AVRRLYDRGV TELRPAPWAE TDAGDRSGQG ARIGVPTVPF PNWLRCTACN ELDVIASRAF
GFENPWLTRP QDARFFHGTC GRKKTGRRPL AVAARFVLAC TAGHLDDFPY PMFVHRGAQC
ARASHPRLRM EDRGGNLGAN VEIRCVNCDA RRNIREAMGR RGAGHLPRCR GRHPHLGTFA
DGGCPQESRL LVVGASNQWF AQTLSVLSVP RTGASELAVK VEQHWEVFEK INNMAILDFA
RSAPIPVFRQ LEKWSNAEIW AAVERHRAAI ETGPAPGSDA DGERAYPDLR TPEWEIFSSA
VLPEPTDDFA LRREPGGVPA PLAGCYADVV QAERLREVRA LTGFTRLDAP DPDDPDLVIR
APLARGPASW VPASEVRGEG MFLRVPEEIL GPWARRVADS VAMREHRDAY SRFRVNRYSD
RIAGDFDPMQ GWPGERFLAL HTLSHLLIRT IALECGYSSA SLSERIYAGD DGDPRGGILI
YTAVPDAEGT LGGLVSLAEP DQLLRLTRRA LADARHCSSD PLCAERLPSA PADFLHGAAC
HVCLFVSETT CERGNRFLDR RFVVPLGDEP QDRALALFGG LP