Gene Francci3_1189 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1189 
Symbol 
ID3903463 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1421529 
End bp1422830 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content71% 
IMG OID637878521 
Producttransposase, IS4 
Protein accessionYP_480297 
Protein GI86739897 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.944159 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCCGGTCC AGTCTCGCAT GCCCGTCACT CCCACCGATC TCGGACAGGC CGGGTCGGGG 
CAGCTGGTCC GGATGCGGCG GTCGCTGCGG GTCCTGGGGG CGCACGGCGG TGAGGTGCAG
GGGCTCGCCG ACGTACTCGC CGGGGTGCCT GACCCGCGGG ACCCGCGAGG GATACGTCAC
CGGCTCCCGG TGATCCTGGG ACTGTCCGCC GCAGCGGTCG CCGCGGGGGA GAAGTCGGTG
GAGGAGATCG CGGCCTGGGC TGCGCACGCC CCGACGCAGG TCCTGACCGC TCTCGGGGCG
CGGGTCCATC CGGTGACCGG GCAGCCGCAG GCACCGTCGG TGGACACGAT GATCCGGGTC
CTGTCCGCGG TGGACAGCTC GGCGCTGGCG AGGGCGGTCG GGATGTTCGC CGCGGCCCGC
GCCCGCCAGG CCCGTGGTGG TGGGCGGCGG GTGGTCGCGG TCGACGGGAA GACCCTGCGT
GGCGCGGCTG GGCCTGAGGG GCGGGCACCG CACCTGCTCG CGGTCGCCGA ACACGGCACG
GGTGTGGTGC TCGCCGAGCA TGAGGTCGGC GCGAAGACGA ACGAGGTCAC CGCGTTCGCA
CCGCTGCTCC GCGAACTGCA TTCCCATGAT CCGCTGGATG GGGTGGTGGT AACCGCTGAT
GCGTTGCACA CGACCCGCGC CCACGCCGAC CTGATCGTCA CCGAGCTGGG AGCGCACTTC
GTGTTCACGG TGAAGGCGAA CACCCCGGCG TTGTCGGTCG ACTGCCACCA GGCGACCGAC
TGGACGAAGA TCCCGATCGG GCACAGCGCC GAGGGCAGGG CCCATGGACG GTTCGAACGA
CGCACCATCC AGCTGGCCCA GGCCAGCGAG GCGATCCGTG CCCGCTATCC CCACGCCCGC
ACCGTGGCGC GGATCCGCCG TCATGTCCGG CGGACCGTGA CCACCGGCAC GGGCCGGGCC
CGGGTCACCC GGACGATCCC GAGCACTGTC ACGGTCCACG TCCTGACGAG CCTCACCCTC
GACGCGGTCA CACCCGCTGA TCTCGCGGGC TACGCCCGAG GGCATTGGAC GATCGAGAAC
AAGGTCCACT GGGTGCGCGA TGTGACGTTC CGTGAGGATG CCTCGCGGGT TCGGACCGGC
CCACTGCCCC GCATCATGAC CACACTCCGT AACCTGATCA TCGGGCTGAT TCGCCTCGCT
GGCCATAACC GCATCGCCCC GACCATCCGC AGAATCCGAC ACGACAACGC CCTGCTCCTG
GCCATCCTCA CTCTCGACAA CCCCGCTGAC CTGCATCAAT GA
 
Protein sequence
MPVQSRMPVT PTDLGQAGSG QLVRMRRSLR VLGAHGGEVQ GLADVLAGVP DPRDPRGIRH 
RLPVILGLSA AAVAAGEKSV EEIAAWAAHA PTQVLTALGA RVHPVTGQPQ APSVDTMIRV
LSAVDSSALA RAVGMFAAAR ARQARGGGRR VVAVDGKTLR GAAGPEGRAP HLLAVAEHGT
GVVLAEHEVG AKTNEVTAFA PLLRELHSHD PLDGVVVTAD ALHTTRAHAD LIVTELGAHF
VFTVKANTPA LSVDCHQATD WTKIPIGHSA EGRAHGRFER RTIQLAQASE AIRARYPHAR
TVARIRRHVR RTVTTGTGRA RVTRTIPSTV TVHVLTSLTL DAVTPADLAG YARGHWTIEN
KVHWVRDVTF REDASRVRTG PLPRIMTTLR NLIIGLIRLA GHNRIAPTIR RIRHDNALLL
AILTLDNPAD LHQ