Gene Francci3_4274 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4274 
Symbol 
ID3907241 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp5102798 
End bp5104021 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content66% 
IMG OID637881600 
Productphage integrase 
Protein accessionYP_483349 
Protein GI86742949 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.173259 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGAGT CGAAACAAAA GCGGACTCGC GCGAACGGTG AGGGATCGAT TTACCCGTAC 
CGGAATGGCT TCGCCGCGTA CGTGTGGGTG ACCACACCTG ATGGGAAGTC ACGGCGGAAG
TACGTGTACG GGCAGACCAG GGAGATCGTT CACGAAAAGT GGATCAAACT ACACTCGACG
GCTCGACAGG GACCCATGCC GACACGATCG GTGACGGTGG CGGTATTCGT TGCTCGGTGG
CTGAGTGAGG TGGTGGAGCC GAACCTTGCT CCGCTGACGT ACTCGACATA CGAGACGCTG
GCGAGGCTGT ACATCGTTCC GGGTCTGGGT GCCAAGCGGC TCGACCGGCT GACGGTCCGG
GACGTACAGA AGTGGGTCAA CGGGCTTCAG CGGGCCTGCC AGTGCTGCGC ACAGGGCAAG
GATGCTCGGC GGCCGGAGAG GCGGCGTCGG TGCTGTGCGC TCGGCCGGTG CTGTGGGCAG
ACCATCTCCG CCCGGACACT GAAGGACGTT CGGGGCGTGC TGCGGTCCGC GTTGACGCAC
GCTGGTCGCG AGGAGTTGGT GTCCAAGAAC GTTGCCGGCC TGGTCAAGGT TCCGAAGGTC
AGGGCGCGAC GTCGGAAGGC GTGGACCACG GATGAGGCGC GGATCTTTCT GGAGTCGGCC
CGTGGTGATC GGTACTACGC CGCGTACGTC CTGATCGTCG TTCTCGGCTT CCGTAAGGGG
GAGGCGCTGG GGCTGCCGGA CGTGACCGAT GACGGGCCGG AGGAGTTGGC GGTGGAGTGG
CAGCTTCAAC GGGTCCGGGG GCAGCTTCTG CATCGAGAGA CGAAGACGGC GGGATCGGAC
GCCACGCTGC CCCTGCCGCA GATCTGCCGT ACGGCGATCG CTGAGAGGCG GAGGCTGCGG
GCGGAGGATC GGAAGGCTGC GGGGGCTGCC TGGCAAGAGT CAGGACTGTT CACCACGGGA
CGCTTCGGCA CCGCGGTGGA GCCTCGGACG TTCGACCGGG CGTTCGCTCT GCGGGTACAG
AAGGCGGGGG TGCCTCGGAT CACAGTGCAC GACGCCCGGC GGACCTGTGC GTCGCTGCTG
GTGGATCTCG ACGTGCACCC ACGGGTGATC ATGCGCATCC TGCGGCACGC GAACATCGAC
GTGACGATGG AGATCTACGC GCAAGCGTCG TCCACGGCCA CGCGGGAGGC GTTGAACCGG
CTAGGCGAAA GTCTTGATCG GTAG
 
Protein sequence
MNESKQKRTR ANGEGSIYPY RNGFAAYVWV TTPDGKSRRK YVYGQTREIV HEKWIKLHST 
ARQGPMPTRS VTVAVFVARW LSEVVEPNLA PLTYSTYETL ARLYIVPGLG AKRLDRLTVR
DVQKWVNGLQ RACQCCAQGK DARRPERRRR CCALGRCCGQ TISARTLKDV RGVLRSALTH
AGREELVSKN VAGLVKVPKV RARRRKAWTT DEARIFLESA RGDRYYAAYV LIVVLGFRKG
EALGLPDVTD DGPEELAVEW QLQRVRGQLL HRETKTAGSD ATLPLPQICR TAIAERRRLR
AEDRKAAGAA WQESGLFTTG RFGTAVEPRT FDRAFALRVQ KAGVPRITVH DARRTCASLL
VDLDVHPRVI MRILRHANID VTMEIYAQAS STATREALNR LGESLDR