Gene Francci3_1104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1104 
Symbol 
ID3905775 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1317629 
End bp1318846 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content74% 
IMG OID637878436 
Producttransposase IS66 
Protein accessionYP_480213 
Protein GI86739813 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.159368 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.482446 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGGGTCG TACGGCGGAC GACCCGACGA CGCCGCTACC GGCGGGCCTG CGCCTGCTAC 
GGCCCGGCGA CGGTCATGGC GCCGGGCGAA CCGCGGGCAC TCGGCAAGGG GCTGTGGACG
AACCGGTCGC TCGCGCTGCT GCTCGTCGAG CGCTACGGCG CGGGCCGGTC CCTGAACTCG
CTGGTGGCGG GCCTGGCCCG GCACGGCGCG GTGCTGGCGG CCCCGACCCT GGTGGGTGCG
TGCGCCCAGG CCGGCGTGCT GCTCGCCCCG CTGGTCGAGG CGATCCGCCA ACGGTCCCAG
GCATCGTGGC ACCTGCACGC CGACGAGACG AGCTGGAAGG TGTTCACCCC GAACGGCGGC
GGGAAGCCGC AGCGCTGGTG GCTGTGGGTC TCCCTCGGCG AGGACACGGT GTGCTTCGTG
ATCGACCAGA CCCGGGCCAG TTCGGTGCTC ACCGGCCACC TCGGCCTCAC CGAGCAGGCG
GACGGCACGC TCACCGCCCC CGGCGGCGGC GAGCGGGTAT TGTCCTCCGA CTTCTACGCG
GTCTACGTCT CCGTCGGCCG CCGCCGCGCC GGCCTCGTCA ACCTCTACCG CGTCGCCCAC
CTTCGCCGCT ACTTCCCGCG GGCGAGGCTG TCGAACCCGG TCCAGCTGGA GTACTGGGAG
AAGGCCTGGC TCGACCGGTT CCGCGCGCTC TACACAGCCC ACCGGGAACT GGCCACGGCG
TGGGCGCGAG CGCGCGACAC CCCCGGCCCC GACGCCGACA CCCGGCTGGC CGAGGCCTAC
ACCGCCTGGG ACGGCGCGAT CGAGGCGATC GACACCGCCC GCCGCGAGCA GCAGGCCTCC
CCAGGGCTAC AGCCCGCGGC GAAGGACGCC CTCGCCACCC TCGAACGCGA GTGGGACGGG
GTCGTCGCGC ACCGCGACTA CCCCATGGTA GATCTTGACA ACAACGCCGC GGAGCGCGCC
CCGCGCCGCC CGGTCGTGAC CCGCAAGAAC GCCTACGGCT CCCGCACCGA CGACGCCGCG
GCCCTCGCCG CCGCCGTCTG GACCGTCCTG GGCACCGCCG AGAAGCACGG ACTGAACACG
CTGACCTACC TGACCGCCTA CCTCGACGCC TGCGGCCGAG CCGGCGGCAA ACCCCCACAG
GGAACCGACC TCGACCGGTT CCTGCCCTGG CTGGCCAGCC CCGACGACCT CGCCACGTGG
AAACAGCCAC CCGGCTGA
 
Protein sequence
MRVVRRTTRR RRYRRACACY GPATVMAPGE PRALGKGLWT NRSLALLLVE RYGAGRSLNS 
LVAGLARHGA VLAAPTLVGA CAQAGVLLAP LVEAIRQRSQ ASWHLHADET SWKVFTPNGG
GKPQRWWLWV SLGEDTVCFV IDQTRASSVL TGHLGLTEQA DGTLTAPGGG ERVLSSDFYA
VYVSVGRRRA GLVNLYRVAH LRRYFPRARL SNPVQLEYWE KAWLDRFRAL YTAHRELATA
WARARDTPGP DADTRLAEAY TAWDGAIEAI DTARREQQAS PGLQPAAKDA LATLEREWDG
VVAHRDYPMV DLDNNAAERA PRRPVVTRKN AYGSRTDDAA ALAAAVWTVL GTAEKHGLNT
LTYLTAYLDA CGRAGGKPPQ GTDLDRFLPW LASPDDLATW KQPPG