Gene Franean1_6777 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6777 
Symbol 
ID5675090 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8248683 
End bp8250653 
Gene Length1971 bp 
Protein Length656 aa 
Translation table11 
GC content72% 
IMG OID641245626 
Productresolvase domain-containing protein 
Protein accessionYP_001511017 
Protein GI158318509 
COG category[L] Replication, recombination and repair 
COG ID[COG1961] Site-specific recombinases, DNA invertase Pin homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGGGGC TGGTCGCGGG TAGCCCGGGC AAGGTGCAGG GCTGGCACCG TGACCGCCTT 
GCGGTGGTGT ACGTGCGTCA GTCCTCCCGG CAGCAGGTCG CCGATCATGG GGAGTCCACG
CGGCTGCAGT ACGGGTTGGT GGAGCGGGCG GTGGCGCTGG GCTGGCCGCG GGTCCGGGTG
CGGGTGATCG ATGAGGATCT GGGGCGCTCG GCGGCGGGTG CGCAGGACCG GCCGGGTTTC
CGGCGTCTGG TCACCGAGAT CTCGATGGGC CGGGTGGGGC TGGTGCTGGG GTTGGAGATG
TCCCGGCTGG CCCGGGCGGG CCGGGACTGG CATCAGCTGG TCGAGCTGTG TGGGCTGTCG
GGGACCCTGC TGGGGGACAC CGACGGGGTT TACGATCCGG AGGAGTACAA CGACCGGCTG
TTGCTGGGTC TGAAGGGGAC CATGAGCGAG GCCGAACTCC ATCTGATCAA GCAGCGGATG
GCGTCGGGGC GACTGGCCAA GGCCGCCCGG GGAGAGCTGG CGGTTCCGCT GCCGACCGGG
TATGTCCGCA GGCCCTCCGG TGAGGTGGCG TTCGACCCCG ACGAGCAGGT CCAGGCCGTC
GTCCGGCTGG TGTTCTCGCT GTTCGCCGAG CTGGGCACGG TGCATGCGGT GCTGCGTTTC
CTGACCGAGC ACCAGATCCA GATCGGGATA CGGGAACGGG CAGGGCCGGC CAAGGGCGAG
GTGGCCTGGC GTCACGCGCA CCAGACCGGG CTGGTCAACA TGCTGCGCAA CCCGGCCTAT
GCCGGGATCT ACGCCTATGG GCGCAGCCGC GTCGCGGTGC GCGGCGGGCC GCCGCGCGGC
GGCCGGGTCC ACACCGGCCC CGAAGGGTGG CTGGTGACGA TCCCGGGGCT GTTGCCCGCT
TATATCAGTG TCGAGCAGTA CCAGGCCAAC CTGGCCCGGA TGGCGGCCAA CCGGGCCCGG
GCGGAAAGCC TCGGCGCTGT CCGGGACGGC CCCGCGCTGC TGACCGGCCT GGTGGTCTGC
GGAGTCTGCC GCCGGCGGAT GGGCGTGGCC TACGAGGCCA GCCGCACCGG TGTGGTGCAC
CGCTACGTCT GCCAGCGCAA CCATCTGACC TACGGGGTGG GGCGCTGCCA GCAGATGGCC
GGCGCTTTCC TGGATGCCCA TGTGGTCGCC CAGGTGCTGG CCGCGCTGAC CCCGGCCGGG
CTGGAGCTGT CGGTGCAGGC CGCCGAGTGC GTCGAGCGGC GCCGGCAGGA GGTGGACCAC
ATCTGGCGGC AGCGGCTGGA ACGCGCCGAG CAGGCCTGCG TGCGGGCGCG GCGTCAGTAC
CAGCTCGCCG AGCCGGCGGG GAGGAATCGT TCCCCCAGCA TCCGTCCCGC CGCCCGCAAC
GCCCCATCCA ACGGATCCGC GGCGACGCCA CCGGCACCCG GCGCTTTCAG GACCAGCCCG
CCGGTGGACC TGCGGGCCCG GACCCGGTCC AGTTCGTGCA GCACCTGGCC AGCGGACCAG
TCCACCCCGG CCGGCCGGCC CTCCACCGTG CCTTCGGTGG TGCTCAGGGC CGCCCGGTTC
GGGGAGATGC TCAAAGTTGC GGCGCGGGGG CCTTCGGCCC GGTACGCCAG ACCCGGTGGA
GGGCGCACAG TCCCGTCCGC GGCCGGCCGG GACCGCCGCT GGCCCTGCAA GGAGGCGACC
AGTTCCCCGA CCCGGGCCGT GTATGTGGCC GGATCATGGG TGTCGCGGAA ATCCACCCAC
ACCCGCGACG CCAGAAACGC CGGCAGGTCA GCGTCTTTGA GGAGGACGGG GATTGTGAGG
CGTTGCTGGC CGGCGACCGC GCGGGTCAGC ATCGCCGCGT ACTCCTCACC CACCCACGGC
CGCGACAGCG CCTGCGGGCT GACGACCAGG ATCCCCGACG CCGAACCCAG GATGCCCGCG
TCGAGGCGGT GGGCGAGTAC ATCCCCGGCG TCGATCTCCC ACTCGTCGTA G
 
Protein sequence
MLGLVAGSPG KVQGWHRDRL AVVYVRQSSR QQVADHGEST RLQYGLVERA VALGWPRVRV 
RVIDEDLGRS AAGAQDRPGF RRLVTEISMG RVGLVLGLEM SRLARAGRDW HQLVELCGLS
GTLLGDTDGV YDPEEYNDRL LLGLKGTMSE AELHLIKQRM ASGRLAKAAR GELAVPLPTG
YVRRPSGEVA FDPDEQVQAV VRLVFSLFAE LGTVHAVLRF LTEHQIQIGI RERAGPAKGE
VAWRHAHQTG LVNMLRNPAY AGIYAYGRSR VAVRGGPPRG GRVHTGPEGW LVTIPGLLPA
YISVEQYQAN LARMAANRAR AESLGAVRDG PALLTGLVVC GVCRRRMGVA YEASRTGVVH
RYVCQRNHLT YGVGRCQQMA GAFLDAHVVA QVLAALTPAG LELSVQAAEC VERRRQEVDH
IWRQRLERAE QACVRARRQY QLAEPAGRNR SPSIRPAARN APSNGSAATP PAPGAFRTSP
PVDLRARTRS SSCSTWPADQ STPAGRPSTV PSVVLRAARF GEMLKVAARG PSARYARPGG
GRTVPSAAGR DRRWPCKEAT SSPTRAVYVA GSWVSRKSTH TRDARNAGRS ASLRRTGIVR
RCWPATARVS IAAYSSPTHG RDSACGLTTR IPDAEPRMPA SRRWASTSPA SISHSS