Gene Francci3_2963 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2963 
Symbol 
ID3903778 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3510249 
End bp3511448 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content70% 
IMG OID637880284 
Productputative transposase, IS891/IS1136/IS1341 
Protein accessionYP_482050 
Protein GI86741650 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID[TIGR01766] transposase, IS605 OrfB family, central region 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.268101 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.616422 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGACGGT CGTACAAGTT CCTGCTCCGC CCGACCGCGC ACCAGCAGAT CGCGTTGACG 
GCGATGCTCG ACGACCACCG GGCGCTGTAC AACGCCGCGC TTCAGGAGCG GCGCGATGCC
TACCGGCATC CGAGCAAGAC GACGGTCCGC TACGGGGGCC AGTCGGCCCA GCTCAAGGAC
ATTCGGGGGT TTGACGCGGA TCAGGCCCGC TGGTCGTTCT CCTCTCAGCA GGCCACCCTG
CGTCGCCTCA ATCTCGCATT CGCCGCGTTC TTCCGCCGGG TCAAGGCGGG CGAGACGCCC
GGCTACCCGC GGTTCAAGGG CGCCGGCTGG TTCGACACGG TGACGTGGCC CGCCAACGGT
GACGGGTGCC TCTGGGACTC GCAGCCCGAG AACCCGAAGA CGATGTTCGT CCGCCTTCAG
GGCGTCGGCC ACGTCAAGGT CCACCAGCAC CGGCCAGTGG CTGGCCGGGT CAAGACGCTG
TCCGTCAGGC GGGAGGGTGC CCGCTGGTAC CTCGTGCTGT CCTGCGACGA CGTGCCCGCC
GAGCCGCTGG AACCAACGGG CGCCGTCGTC GGCGTGGACT TGGGTGTCGC CTCGCTGGCC
AGCACGTCGA ACGGCGAGCA CTACGGCAAC CCGCGTTTCC TCGAACGGGC CGCCGGGCGC
CTCGCCAACG CGCAGCGCGA CCTTGCCCGT AAGAAGCGCG GCTCGAAGCG GCGCCGCAAG
GCGGCTACTC GGGTCGCGAA CCAGTCCCGG GCCGTGGCCC GGCAGCGCGT CGACCTCGCG
AACAAGACCG CGCGCGAACT GGTCGCCGAC CACGACCTGA TCGCTGTCGA GAAGCTGAAC
GTCAAGGGCA TGGTCCGCCG GGCCAAGCCG AAGCCCGACC CGGACCAGCC GGGGGCGTTC
CTGCCGAACG GGCAGGCCGC CAAGTCCGGG CTGAACAGGT CGATTCTCGA CGCCGGGTGG
GGGGTGTTCC TCAACGCACT GCGTGCCAAG GCTGAAAGCG CCGGGCGGGT CGTCGTCGAG
GTCAACCCCC GCCACACCTC CCAGCGATGC GCCGAATGTG GCCATGTCGC CCCGGAGAAC
CGGCCCAGTC AGGCCACGTT CCGCTGTGTG GAGTGCGGCC ACGCCGCGCA CGCAGACGTG
AACGCGGCGA TCAACATACT CGGGGCGGGG CTCGCCCTTC AGGTGGCGCA AGCTTCCTGA
 
Protein sequence
MRRSYKFLLR PTAHQQIALT AMLDDHRALY NAALQERRDA YRHPSKTTVR YGGQSAQLKD 
IRGFDADQAR WSFSSQQATL RRLNLAFAAF FRRVKAGETP GYPRFKGAGW FDTVTWPANG
DGCLWDSQPE NPKTMFVRLQ GVGHVKVHQH RPVAGRVKTL SVRREGARWY LVLSCDDVPA
EPLEPTGAVV GVDLGVASLA STSNGEHYGN PRFLERAAGR LANAQRDLAR KKRGSKRRRK
AATRVANQSR AVARQRVDLA NKTARELVAD HDLIAVEKLN VKGMVRRAKP KPDPDQPGAF
LPNGQAAKSG LNRSILDAGW GVFLNALRAK AESAGRVVVE VNPRHTSQRC AECGHVAPEN
RPSQATFRCV ECGHAAHADV NAAINILGAG LALQVAQAS