Gene Francci3_1959 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1959 
Symbol 
ID3904321 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2300823 
End bp2302085 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content69% 
IMG OID637879296 
Producttransposase IS116/IS110/IS902 
Protein accessionYP_481063 
Protein GI86740663 
COG category[L] Replication, recombination and repair 
COG ID[COG3547] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.945198 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAGCGGC AGGTGGCGCG GTGCGCCGGT CTGGACGTGC ACAAGGACGA GATCGTCGCC 
TGTGCGCGGA TCTCCGATCC GGGTGGGCCT GGCCGGGTCG AGTTGCACAC GTTCGGGACG
ACGACGCGTG AGCTGTTGGC GTTGCGGGAC TGGCTGACGG GGCTGGGCGT GACCCGGGTC
GGGATGGAGT CGACCGGGGT TTTCTGGAAG GCGCCGTTCC ACATTCTGGA GGACGCGGTC
GCCGAGTGCT GGCTGCTCAA TGCCCGGCAT CTGCGCAACG TGCCGGGCCG TAAGACGGAT
GCGGCGGACG CGGCGTGGAT CGCGGAGCTG GTCGAGTACG GCCTGGTCCG CCCGTCGTTC
GTGCCGCCCC AGCCGATCCG GGAACTACGT GACCTGACCC GGTATCGCCG CGCGCAGATC
GACGAGCGGA CCCGGGAGGC GCAGCGCCTC GACAAGGTCC TCCAGGACGC GGGGATCAAG
CTGTCGTCGG TCGCGTCGGA TGTGCTCGGG AAGTCCGGGC GGGCGATCCT CGACGCGCTG
GTCGCGGGGA CCACCGACCC GGTCGTGCTC GCCGAGCTCG CCAAGGGCCA GCTCCGTAAG
AAGATCCCGG CGCTGCAAGA GGCGCTGACC GCGTTCTTCA CCGGCCATCA CGCGATCATC
ATCGGGGAGA TCCTGTCCAA GCTGGACTAC CTGGACGAGG CCATCGACCG GCTCTCGACC
GAGATCGACC GGGTGATCGC CCCTTTCGCG GATGAGGTCG CCCTGCTGGA CACGATCCCC
GGGGTCGACC GCCGCATGGC CGAATGCCTG ATCGCCGAGA TCGGCGTCGA CATGACCGTC
TTCGGCTCCG CCGAACGGCT CGCCTCCTGG GCGGGCCGCT GCCCCGGCCA GCACGAGTCC
GCCGGCAAGT CCAAAGGCGG CCGGACCCGC AAAGGGTCGA AATGGCTGCG GATCTACCTG
CACGACGCCG CCCGCGCCGC CAGCCGCACC AAGAACAGCT ACCTCAACGC CCAATATCAC
CGGATCAAAG CCCGCCGCGG CCCCGCCAAA GCCAGGGTCG CCGTCGAGCA CTCGATCCTC
GTCGCCGCCT TCCACATGCT CGACCGAGGC GAGCCCTACC ACGACCTCGG CGCCGACTAC
TTCACCCGAC GCCGCGACCC CAACCGCCAC GCCCAACGCC TGATCAGCCA GCTCGACGCC
CTCGGCTACG ACGCGGTCAT CACCAGACGA ACCGACCAGC CCACCGACAC CAAGGCCGCG
TGA
 
Protein sequence
MERQVARCAG LDVHKDEIVA CARISDPGGP GRVELHTFGT TTRELLALRD WLTGLGVTRV 
GMESTGVFWK APFHILEDAV AECWLLNARH LRNVPGRKTD AADAAWIAEL VEYGLVRPSF
VPPQPIRELR DLTRYRRAQI DERTREAQRL DKVLQDAGIK LSSVASDVLG KSGRAILDAL
VAGTTDPVVL AELAKGQLRK KIPALQEALT AFFTGHHAII IGEILSKLDY LDEAIDRLST
EIDRVIAPFA DEVALLDTIP GVDRRMAECL IAEIGVDMTV FGSAERLASW AGRCPGQHES
AGKSKGGRTR KGSKWLRIYL HDAARAASRT KNSYLNAQYH RIKARRGPAK ARVAVEHSIL
VAAFHMLDRG EPYHDLGADY FTRRRDPNRH AQRLISQLDA LGYDAVITRR TDQPTDTKAA