Gene Franean1_2759 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2759 
Symbol 
ID5671148 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3265314 
End bp3267140 
Gene Length1827 bp 
Protein Length608 aa 
Translation table11 
GC content66% 
IMG OID641241668 
ProductATP-dependent OLD family endonuclease 
Protein accessionYP_001507088 
Protein GI158314580 
COG category[L] Replication, recombination and repair 
COG ID[COG3593] Predicted ATP-dependent endonuclease of the OLD family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGTCT GTCGTGTGAC CCTGCGCCAC TTCCGTGGCG TTGAGGCCGG GACGGTGTAT 
CTCGACGGGG ACACGTTGCT GGTCGGGTCC AACAGCGTGG GCAAGTCCAC CGTGTGCGAG
GCGCTGGATC TGGTTCTCGG CCCGGAGCGG ATGTTCCGCC GGCCTGTGAT CGACGAGTAC
GACTTCTTCG CCAGCCAGTA TCAGGACGTG GACGGATCTC TCCCCGAGAT ACGCATCGAG
GTGGTCCTGA CTGACCTGAC GCCGGAAGCG CGGCGGCGTT TCCACGGCCG CATGCGTCGC
TGGTCACCAC GCCGACGTGA CTTCGCGGAG TCGGCGCCCG CTGGCGACGA GGGGGACAGC
GAGGACGTGT GGTGCCTGCC CGTCGTGTTC CTGGGCCGGT TCAACCCGGA CGAGGACGAC
TTCGAGGGCA GCACGTTCTT CGCGCATCCG GAACCCGTGG TTGACGACCT CACCGATGAA
TCCACCGAGC TGGGCGGTGG GTTGAAGCCG TTCACCCGTG AGGACAAGCG GCACTGTGGC
TTCCTGTACC TGCGTCCCAA CCGTACGGGC AGCCGCGCGC TCAGCTTCCA GCGCGGTTCG
CTACTCGACA CCATCGTCCG TCTGGAGGCG GAGTCCACCG GCCAGCTGTG GGAGACGGCT
CTCCGCGACG TTGAGGAGGT GGTGATCGCT GGGGACGATT CCGCGTTCTT CAAGGTCCGT
GAGCAGCTCC GTGCCCGCAT CGAGCGGTTC CTCAGCCTGA GTGACGGCCC CGGCGCGGTG
GACGTCCGCG TCTCAGAGCT GACGCGGGAC CATCTCCGCG AGGTCCTGCG GCTGTTCCTG
TCTACCCAGC CCGGAGCGCA TGGTGTGCCG TTCAACCGGC TAAGCACGGG CTCGTTGAAC
CTGCTCGTGT TCGCGCTGCT GACCTACATC GCCGAGCTGA AGGGCGACGA CTCGGTGATC
TTCGCGATGG AGGAGCCGGA GATCGCGCTG CCGCCGCACG CCCAGCGCCG GCTGGTCGAC
TTCGTCGTCA GCCGAATGGG GCAGGCGATC ATCACGTCGC ACTCGCCGTA CGTGATCGAG
AAGTTCGACC CCGGCCAGAT CGTCGTCCTG AGCCACGACA GCACCGGCAC GCTCACCAGC
ACCCCCATCA GTCTCCCGGA CGACTTCAAG CCGAAGAAGT ACCGCAACAA CCGGCGTCAG
TTCGCCGAGG CTGTACTGGC CCGCGCTGTC CTCGTTGTGG AGGGCGCCAC GGAGGTGCTG
GTCTTCCGGG CCGTCTCCGA CGTCCTCGAC CGCGACGAGA CCGCCGTCGG CTACCTGCAC
CTGGACCTGG CCGGTATCTC GATCTTCGAT GCCGAGAACG ACGTGTCAGT CCCGGTGTTC
GCCCCGGTCT TCGCCGGCAT GGGCAAGAAG GTCTTCGGCA TACACGACAC TCTGAACAAG
CCTCTGGAAC CCGACGTGGC AGCGAAGACG AGCCGGTTCA CCCGGTACGT GGTCATCCCT
TACAAGGGGA TCGAGGCGTT GCTGGTCACC GAGACACCCA TCGCTGTCCA GAGGCGGTTC
GCTGTGTCCG TGACGACCCG CCTCGACTGC CCAAGGGGGG TGGGTCAGCT GGCGACGACG
GCGACCGATG AGGAGGTCCG GTCACACGTC CGGGAACTTC TGCAAACGCA TAAGGGGTCG
AACGGCTACG CGGCTCTGCT GATCGCTGAG TGCTCCGGCA AGGCCGAGCT GCCGCCCACC
CTTGCCGCCT TCCTGACCTC CATCGACGCG GACCTCCGCC AGGAGCCGCC GGAGTCCGCT
GACGTGGTAG CCGTCCCGGC GATCTGA
 
Protein sequence
MQVCRVTLRH FRGVEAGTVY LDGDTLLVGS NSVGKSTVCE ALDLVLGPER MFRRPVIDEY 
DFFASQYQDV DGSLPEIRIE VVLTDLTPEA RRRFHGRMRR WSPRRRDFAE SAPAGDEGDS
EDVWCLPVVF LGRFNPDEDD FEGSTFFAHP EPVVDDLTDE STELGGGLKP FTREDKRHCG
FLYLRPNRTG SRALSFQRGS LLDTIVRLEA ESTGQLWETA LRDVEEVVIA GDDSAFFKVR
EQLRARIERF LSLSDGPGAV DVRVSELTRD HLREVLRLFL STQPGAHGVP FNRLSTGSLN
LLVFALLTYI AELKGDDSVI FAMEEPEIAL PPHAQRRLVD FVVSRMGQAI ITSHSPYVIE
KFDPGQIVVL SHDSTGTLTS TPISLPDDFK PKKYRNNRRQ FAEAVLARAV LVVEGATEVL
VFRAVSDVLD RDETAVGYLH LDLAGISIFD AENDVSVPVF APVFAGMGKK VFGIHDTLNK
PLEPDVAAKT SRFTRYVVIP YKGIEALLVT ETPIAVQRRF AVSVTTRLDC PRGVGQLATT
ATDEEVRSHV RELLQTHKGS NGYAALLIAE CSGKAELPPT LAAFLTSIDA DLRQEPPESA
DVVAVPAI