Gene Franean1_3000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3000 
Symbol 
ID5671383 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3529772 
End bp3530974 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content68% 
IMG OID641241903 
Productglyoxalase/bleomycin resistance protein/dioxygenase 
Protein accessionYP_001507323 
Protein GI158314815 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTCCAG AGCCAGTGAC CAGGCGGCCC TCGGCGACCG CAGACCTGGC GACGCGGCAG 
TCGACCGGTG TGCGGCCGTC TCTGCCCATC GCCGCGGGAG GCCTTGGCAG TGGCCATCCG
GGCCGGGCCA AGAGCCCTGT CGTGAAGGTG GTGGATCTGG CCTTCCTGGA GTTCGAACGG
CCGGACCTCG ACCGTTCCGA GGCATTCGCC CGAGATTTCG GGTTTGCCGT GGCGCAGCGG
ACACCGGACA CGTTGATGTT GCGGGGGATC CTGTCCGGGG GCCCGTGCAT GATGATCCGC
CGCAGTACGG CGGCGCGGTT CGTCGGGCCG GCGTTCGCGG CCGCTGACGC CACAGACCTG
AACCGGCTTG CCCAGGCCAC CGACGCAACG GTCCGCGACC TCGCTACGGC GGTCCCCGCC
TTCGGCGGTG GGATCCTCGA CGGTGCCACA GCCGTCGAGC TGCGTGACCC CACCGGACTG
CCGGTACGGG TGGTGCACGG AATACCGGAG CTGCCGGCGC TGGACGAACA ACACCCACTG
GTCTTGAATG TCGGATCCCA GACACCGAGA GTGAATCTGA CCCAACGCCC ACCCCGAGAG
CCGGCCCGTG TCCAGCGGCT GGGACACCTC GTGCTGGAGT CCCCGGTCTT CGGCCGCGCA
CTCGACTGGT ACCTGCAGAC TCTCGGCCTG ATCGTCAGCG ACTTCCTCTT CCTCGACGGC
CAGCGCGACC GCGGCCCGAC GATGGCGTTC ATCCGGTGCG ACCAGGGCCG CCGGCCGGTC
GACCACCACA CACTGGCGAT GCTGCTTGGC CCGAGCGGCG GCTACGTCCA CTCCGCATAT
CAGGTCAGCG ACCTCGACGC GCTTGCCGCC GGCGGCGAGT ACCTGCGGGA ACGAGGCTGG
CGACGCAGCT GGGGAATAGG CCGGCACATC CAGGGCAGCC AGATCTTCGA CTACTGGCGG
GACCCGGATG GCTTCCTGGT CGAGCACTTC ACCGACGGCG ATCTTTTCGA CGCCTCCACC
GAACCCACCT GGACGCCGAT GTCCGCCAGC GGGCTAGCCC AATGGGGCCC ACGCGCCACC
ACCGACTTCC TCGGAACCCG GCCATCACCA CGGCTGCTCC ACACCATCTT CACAGCGCTG
CGCGGCGACA ACGAGATCGA CCTCGCCCGC ATCAAAGGCC TGAAGAAAGC GATGAGCCGA
TGA
 
Protein sequence
MVPEPVTRRP SATADLATRQ STGVRPSLPI AAGGLGSGHP GRAKSPVVKV VDLAFLEFER 
PDLDRSEAFA RDFGFAVAQR TPDTLMLRGI LSGGPCMMIR RSTAARFVGP AFAAADATDL
NRLAQATDAT VRDLATAVPA FGGGILDGAT AVELRDPTGL PVRVVHGIPE LPALDEQHPL
VLNVGSQTPR VNLTQRPPRE PARVQRLGHL VLESPVFGRA LDWYLQTLGL IVSDFLFLDG
QRDRGPTMAF IRCDQGRRPV DHHTLAMLLG PSGGYVHSAY QVSDLDALAA GGEYLRERGW
RRSWGIGRHI QGSQIFDYWR DPDGFLVEHF TDGDLFDAST EPTWTPMSAS GLAQWGPRAT
TDFLGTRPSP RLLHTIFTAL RGDNEIDLAR IKGLKKAMSR