Gene Franean1_6280 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6280 
Symbol 
ID5674599 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7627254 
End bp7628261 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content74% 
IMG OID641245132 
Productintegrase family protein 
Protein accessionYP_001510528 
Protein GI158318020 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.737923 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGACC CTGCCGGAAC CCTCGCCCCG GTTCCGTCCG TTGACCTCGA CATCGTCGGT 
GCCCGGGACG CGAGCGTCTC CGCGACCACC GCCCGCATGA TCGACGACGC GACCGCAGCG
AACACCAAGC GCGCGTACGC CAGACAGTGG TCGACCTTCA CGGCCTGGTG TTCCCAGGAG
GGCCGGACCA TGCTCCCGTG CTCGGATGCC ACGCTCGCCG AGTACGCGGC GCAGCTCGTC
ACCGCCGGGG CCGGGCCGGC CACCGTGGAA CAGGCCATTG CCACCATCCG CCGGGTCCAC
CGCGACCAGG CCGCCACCCC GCCCGACACC CGCGCCGCAC GGCTAGTGCT GCGCACAGCC
CGCCGGGAAC GCGCCGACGC CGGGCAGGCC ACCCGACAGG CACCGCCAGC CGCGCTGGAT
CAGCTCCGCG CCATGCTCGC CGCCTGCGAC ACCAGCACCC GCGGAGTCCG TGACCGGGCC
CTGCTGCTGC TCGGCTTCGC GATGATGGGC CGCCGTTCGG AGCTCGCCGC GCTCGACCTG
GCCGACATCC GCGAGGTCGA CGAGGGCCTG ATCGTCGTCG TCCGCCGGTC GAAGACCGAC
CAAGACGGCC GCGGCGCCGA GGTCGCCGTT CCCTACGGCA GCCGGCCGGA CTCGTGCCCT
GTGCGCGCGG TCCGGGCCTG GCGCGGCTGG CTCGCCTCCA TCGGCATCAC CGAGGGCCGA
CTGTTCCGTT CCGTAACGAG GCACGGCCAC ATCGGCGAGG CCATGTCCGG CGACGGGATC
CGCCGAGCCG TCCGCGCCGC CGCCGTCCGG GCCGGCCTCC CGAACGCGGA CGTCTTCTCG
GCTCACTCGC TGCGGGCTGG AGGGGCGACA GCCGCGGCGA AGGCCGGCGC CCCGGTCGCC
GCGATCGCCC GGCAAGGCCG CTGGTCACCC ACCTCGCCCG TGGTGCACTC GTACATCCGG
GCAGCCGACC GATGGCGGGA CAACCCGATG GCCAGTGTGG GCCTGTGA
 
Protein sequence
MIDPAGTLAP VPSVDLDIVG ARDASVSATT ARMIDDATAA NTKRAYARQW STFTAWCSQE 
GRTMLPCSDA TLAEYAAQLV TAGAGPATVE QAIATIRRVH RDQAATPPDT RAARLVLRTA
RRERADAGQA TRQAPPAALD QLRAMLAACD TSTRGVRDRA LLLLGFAMMG RRSELAALDL
ADIREVDEGL IVVVRRSKTD QDGRGAEVAV PYGSRPDSCP VRAVRAWRGW LASIGITEGR
LFRSVTRHGH IGEAMSGDGI RRAVRAAAVR AGLPNADVFS AHSLRAGGAT AAAKAGAPVA
AIARQGRWSP TSPVVHSYIR AADRWRDNPM ASVGL