Gene Francci3_3805 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3805 
Symbol 
ID3905553 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4562297 
End bp4563487 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content70% 
IMG OID637881131 
ProductDNA end-binding protein Ku 
Protein accessionYP_482884 
Protein GI86742484 
COG category[S] Function unknown 
COG ID[COG1273] Uncharacterized conserved protein 
TIGRFAM ID[TIGR02772] Ku protein, prokaryotic 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.683793 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.320133 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTGCGA CCTGGAAGGG CGTGATCTCC TTCGGCCTGG TGTCCATCCC GGTGAGGCTC 
TACTCGGCGA CCCAGGAACG GGATGTCGCG TTTCACCAGG TTAGACGGTC GGATGGGTCC
CGTATCCGAT ACCGGAGAGT GGCGGAGGCC GACGGGGACG AGGTCAACTA CGCCGACATT
GCCAAGGGCT ACGAGCTGCC GGATGGGGAG ACCGTCGTAC TCACCGACGA GGACTTCGCG
AATCTCCCGC TGTCCACCTC CCGGGCGATC GACGTGCTGG AGTTCGTCCC GTTGGAGCAG
GTCGACCCGA TCTACTTCGC GAAGAGCTAC TACGTCGAGC CCGACCGCAC CGGCGCCAAA
CCCTACGTTC TCCTTCGCGA CGCGCTGGCC GCATCGGGCC GTGTCGCGTT GGTGAAGATA
GCCCTGCGGC AACGCGAGCA GCTCGCGACG CTCCGGGTGC GCGGCGGCGT CTTCGTCCTG
GAGACCATGG TCTGGCCGGA CGAGGTGCGG CAACCGGACT TTCCGTTCCT TGAGGAGGAC
GTCGCGGTAC GCCCGCAGGA GCTGTCGGTG GCTGCCTCGC TCATCCACAC CCTGGCAGCC
GACTTCGACC CCACCAGGTA CACCGACAAC TACCGTGAGG CCCTGCAGGC CGTGATCGAT
GCGAAGGTCG CCGGCCGCGA GGTTGTGGCT TCGCCGGGCG GGCCGGCCAG CGAGGCGGTC
GGCGACCTCA TGGCGGCGCT GCGCGCGAGC ATAGCCGCCG CCCGCGCTGG ACGGCCCGGC
GAAGCGGCTG TGGCAGGTGG TGCGGCTGTG GCAGGTGGTG CGGCTGTGGC AGATGGTGAC
GCGGGGCCCG CGGCGGCCGG GGTCACCGAC GAGGGGCCCG ACGACAAGGC GTCCGACGAC
AAGGCGTCCG ACGACAAGGC GTCCGACGGA CGGCGGGGCG GCCGTACCTC CTCGGTCAAG
GGTGCCTCGT CGGCCCCGGG TACGCGGTCG ACCGCCCGGA AGACGCCGTC CTCGACGAGG
AGCACCGCGA AGACGAACGC CGCCACGAAG ACGCCGCCCG CGAAGACCTC CGCGGCCAAG
GCCTCCGCGG CCAAGACCTC CGCGGCCAAG GCCACCTCCT CGAGGACGGC CCCGAAGACG
GCCCCGAGGA CGCCGACCTC GAAGACGCCC CCGACGCGGC GATCCGCCTG A
 
Protein sequence
MRATWKGVIS FGLVSIPVRL YSATQERDVA FHQVRRSDGS RIRYRRVAEA DGDEVNYADI 
AKGYELPDGE TVVLTDEDFA NLPLSTSRAI DVLEFVPLEQ VDPIYFAKSY YVEPDRTGAK
PYVLLRDALA ASGRVALVKI ALRQREQLAT LRVRGGVFVL ETMVWPDEVR QPDFPFLEED
VAVRPQELSV AASLIHTLAA DFDPTRYTDN YREALQAVID AKVAGREVVA SPGGPASEAV
GDLMAALRAS IAAARAGRPG EAAVAGGAAV AGGAAVADGD AGPAAAGVTD EGPDDKASDD
KASDDKASDG RRGGRTSSVK GASSAPGTRS TARKTPSSTR STAKTNAATK TPPAKTSAAK
ASAAKTSAAK ATSSRTAPKT APRTPTSKTP PTRRSA