Gene Francci3_4210 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4210 
Symbol 
ID3907175 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp5026684 
End bp5027664 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content63% 
IMG OID637881537 
Productintegrase 
Protein accessionYP_483286 
Protein GI86742886 
COG category[L] Replication, recombination and repair 
COG ID[COG2826] Transposase and inactivated derivatives, IS30 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACAGTCG AGGATCGGGA GCTGATTTCC CGGGAGCTGA GCAGGAACCG GTCGGCGCGC 
TTCATCGCGA AGGCTCTCGG CCGACATCAT TCGACGATCT CGCGGGAGAT CGAGCGTAAC
GGCGGCGAGA GCGCCTACCG GGCGGTGGAC GCGCAGGCGC GGTGTGACGC GATGCGAAAA
CGTCCGAAGG AACGCAAACT CGTCGCGTCG GCGGCCCTAC ATGACGCGGT CAACGCGGCC
CTGGTCGAGA AATGGTCACC GAAACAGATC AGTGAGAGAC TGGAAAAGGA CTTTCCCGAT
GACGAGAGTA TGCGCGTGTC GCACGAGACG ATCTACGAGT GCCTCTATCT GCAGGCCCGC
GGCGAGCTGC GGACCCAGCT GACGATCGCG CTGCGCAAGG GGCGGGCCAG GCGGGTGAAC
CGATCCCGGA CAGCTGTGGC CCGCGGGCGG ATCGTCGACA TAGTCAACAT CAGCGAGCGG
CCGAAGGAGG CCGAGGACCG CGCCGTGCCC GGGTTCTGGG AAGGTGATCT CATTCTGGGC
AAGGGGAATA AGTCCCAGAT CGCGACGCTG GTCGAGCGTA CCACGCGGTT CGTCATGCTC
GTACGTATTC CCTACGACCG TAATGCCGAG AAGGTCGCCT ACCTGCTGGC CCGGAAGATG
GAAACCCTGC CCGACTTCAT GAAGAAGTCC GTCACCTGGG ACCAGGGCAA GGAAATGGCC
CGGCACGCGA AGTTCACCGT CGCCACCGGC ATGCCCGTCT ACTTCTGCGA TCCGCACTCG
CCGTGGCAGC GCGGCTCGAA CGAGAACACC AACGGACTGC TGCGCCAGTA TTTCCCGAAG
GGCACCGACC TGTCGTTGCA TACCCAGGCC GAGCTTGATA AGCTGGCTGA GCAGTTGAAT
GGGCGACCGC GGCAGACGCT GGGATGGGCG AAGCCAGTCG AAGTCTTCAA TGATCTGCTG
GCAAATCATG CGTCGCTATG A
 
Protein sequence
MTVEDRELIS RELSRNRSAR FIAKALGRHH STISREIERN GGESAYRAVD AQARCDAMRK 
RPKERKLVAS AALHDAVNAA LVEKWSPKQI SERLEKDFPD DESMRVSHET IYECLYLQAR
GELRTQLTIA LRKGRARRVN RSRTAVARGR IVDIVNISER PKEAEDRAVP GFWEGDLILG
KGNKSQIATL VERTTRFVML VRIPYDRNAE KVAYLLARKM ETLPDFMKKS VTWDQGKEMA
RHAKFTVATG MPVYFCDPHS PWQRGSNENT NGLLRQYFPK GTDLSLHTQA ELDKLAEQLN
GRPRQTLGWA KPVEVFNDLL ANHASL