Gene Franean1_4258 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4258 
Symbol 
ID5672613 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5080178 
End bp5081509 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content72% 
IMG OID641243131 
ProductHNH endonuclease 
Protein accessionYP_001508548 
Protein GI158316040 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.137511 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.228597 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATACCG CGCGGAGTGT GCAGGTCGAC GGGGACGGTT CCAGTCCGGT TGACGTGCCG 
GAGCAGGAGA GAACGTTCGA GGGGCGGGTG CGTGGGCTGC TGGTGCGGAT CGGTGCGGCG
GTCCGGTCGA TAGCGGCGGG GAACGCGGAC CTGCTGGGGC TGTTGGCGCA GTTCGCCGAC
CTGCGGCCCC CGGCGGCCGG CCGGGAGGTT CTGTCCGACG AGTTCGCTCC GGAGGATGTC
GCAGCGGTGC GGGGGGTGTC CCCGCAGGCC GCGGCGAGTC AGATGCTGTT CGCGTGCACG
GTGGCGCGCC GGCTGCCCGC CGCGGTGGAG GCGTTGAAGG CCGGGGTGCT GGACGTGCAG
CGGCCGCGTT CGTTGGAGAA CGCGGTCCGT CCGCTCGACG GTCCGCTCGC GGCGCAGGTC
GAGGCCCGTG TGCTGGCCGG GGGTGCGCGG CCGACCCGGG GGGCGTTCAC GGATGCGTGC
CGTCGTGCCG TGCACACGGT GGACCCGGCT GGGGCGGCCG AACGCGCGCG GGCCCGGAAG
AAGGAACGGC GGGTGTGGGT CTCGCCGGGG GAGGACGGGA CGAGTTGCCT GTCGGCGGTG
CTGCCCGCCG ACGAGGCGAC CGCCTGCTAC CAGCGGGTCG ACCAGATCGC CCAGGGAATC
GCCGCGCACC GAGGCGGCGG GGACACCCGC AGCCGTGACC AGATCCGCGC GGACGTCCTC
GTCGATCTCC TGTGTGGGCG GGTCGCGCAT GCGGTGCCGC TGCCGTGTGA GGTGCAGGTC
GTGGTGCCGG TGACGGTGCT GCTGGGGTTG GCGGAGGATC CCGGGGAGAT TCCCGGGTAC
GGGCCGGTTC CCGCCGCGGT GGCCCGGGAG ATGGCCGCAC GGCCGGGGTC GACATGGCGG
CGGATCCTCG CCGACCCTCA GGGCACGCTC GTCGAGATCG CGGACCGGCG CCTACCGACC
GCGGCCCAGG CCCGGCACGT GCGGGCACGG AACCGTAGCT GTGTCTTCCC GGGCTGCGCC
CGTACGTCGC GACGCGCGGA CATCGACCAC ACGGTGGCAC ACGTGAGCGG CGGGCCGACG
CTCACCCGGA ACCTCGGGCC GATATGCCGC AAGCACCACC GCATGAAGCA CTCCGGCCGT
TGGCGGCTGA CACAACCGCG GGAAGGAACG TTCGTCTGGA CGGGTCCGTT CGGGGCGACG
CTCGTCACCC ACCCACATTC ATACATCGAA CCGCAGAACA AGGCCGGTAC GACGGGAGGG
GGTGGTGATG AACCGTCGGG CAACACCGCC TCGGGGTGGA AAATACCCCA CGACACACAG
CCACCCTTCT AA
 
Protein sequence
MNTARSVQVD GDGSSPVDVP EQERTFEGRV RGLLVRIGAA VRSIAAGNAD LLGLLAQFAD 
LRPPAAGREV LSDEFAPEDV AAVRGVSPQA AASQMLFACT VARRLPAAVE ALKAGVLDVQ
RPRSLENAVR PLDGPLAAQV EARVLAGGAR PTRGAFTDAC RRAVHTVDPA GAAERARARK
KERRVWVSPG EDGTSCLSAV LPADEATACY QRVDQIAQGI AAHRGGGDTR SRDQIRADVL
VDLLCGRVAH AVPLPCEVQV VVPVTVLLGL AEDPGEIPGY GPVPAAVARE MAARPGSTWR
RILADPQGTL VEIADRRLPT AAQARHVRAR NRSCVFPGCA RTSRRADIDH TVAHVSGGPT
LTRNLGPICR KHHRMKHSGR WRLTQPREGT FVWTGPFGAT LVTHPHSYIE PQNKAGTTGG
GGDEPSGNTA SGWKIPHDTQ PPF