Gene Franean1_5687 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5687 
Symbol 
ID5674013 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6904851 
End bp6906854 
Gene Length2004 bp 
Protein Length667 aa 
Translation table11 
GC content78% 
IMG OID641244540 
Productadenine deaminase 
Protein accessionYP_001509943 
Protein GI158317435 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG1001] Adenine deaminase 
TIGRFAM ID[TIGR01178] adenine deaminase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.354832 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCTG AGCGGGACGT TGACCGTACG GTCGACGCCG TTCCCTCCCA CGCGCTCCCG 
GCCGCACCCG TGCCGGTGAC ACCAGGGCCG GCCGCTCCGG TCCCGATGAC ACCGGTCCCG
ATGACACCGG TCCCGATGGC GCCGGTCCCG ATGGCTCCGG GGGTGGTGCC TGTCGGGCCG
TTGCGGTCGC TTCCGCCGCT CGCCGCGCCC GCGGTGGCGC GGGTGCCCGG CCGCCGCCGG
GCGACGCCTG TCCCCGGCCG CACCGGCGTC GTGCGCGGCG GCCGGGCCCC CCGCCCGGTG
ACGGTGCCCA CTCCGCTGCG CGAGCCCCTC ACCCCCACCC CGGACGAGCT CGAGCGGCTG
CGCGACATCG CCACCGGCCG CGCCGAGGCC GACTTCGCGA TCCGTGGCGG GGTGGTCTTC
GTCGTCCAGA CCGGCGAGAT GATGCGGCGC GACATTCTCA TCGCCGGCCG CTACATCGCC
GCGGTGACCC GGCCGAGCGC CGTGGGCGCC CGCCGCAGCC TGGACGCGGC CGGCCGGTTC
GTGCTCCCGG CCTACGTCGA CGCCCGCGCG TCGGTCGAGC GGACGCTGCT CGGCCCGGGG
GAGCTCGCCC GGCTCGTCGT CCCGCGCGGC ACGGTGACGG TGCTGACCGA CCTCGCCGAG
ATCGAGCAGA TCCACGGCCC GCGCGGCCGC TCGCTCGTGC TGCGCACCGG CACACCGCTG
CGTGTCCTGC CCCGCGGCCC CGCCGGCACC CGCGACGGGC TCGCCGGCCC GCCCCGTCCC
GCCGCGCCGC ACGAGGGCAG CGCGCCCTAC GACGCCACCG GGCCCTACGA CGCCACCGGG
CCATATGAAG CGACCACGCC GTACGAGGCC ACCGCGCCGC ACCAGGCCTC CGTCGAGGCC
GTGGACGTCG CGCGTGCCGT CCCCCGGCAG GTGAGCCCGC TGACGGGGGT GCCCGGCCCG
TTCATCGGGA GCGCCGGGCC CGCCGCGTTC CTGCCGGGCG CCGCCCATCT CGTCGGCGGG
CCCGCCGGAG CCGAGATGCC GCCGGTGGGT GGCGGCCTGA TCTCCGATCA CCTCGACGAC
CGGGTACGGG AGGAGATCCG AGCGGGCGGC GCCGCCGTCC GTGTCGTCCG GGACGCCACC
CTCACCCCCG CGCGTGCCTT CGGCCTCGAC CACGTGCTCG GCTCGATCGC TCCGGCCCGG
CTCGCCGACC TGCAGATCGT GCAGGACCTG ACCGCGCGGC TCCCCCCGGA CGTCGTGGTG
GCCGGTGGGC GGATCGCGGC CGAGCGCGGC CGCGCGCTGT TCGACAACTT CGACGTGGCC
CCGACGTGGG CGTCGTCCTC GGTGCGCCTG CCCGCCGGCC TCTACACCGG TTCGTTCACC
TCACCCTGCA TGCACTGGAG CGGCCGGCGC GACACCCGGC TCACCGTCGT CGACGTGCGG
GCCCCCGCCG TCGCCACCAG CCCGGCGGCC TCGATCGGGG TTTCGGGGGT CTCGGCGTCC
TCGGTCGGGC TCGGGCCGCT CGGCGCGGGC GGGGAGCCGC CGCGGGTGAC CGTCGCCCGG
GTGGAACCGA CCCTGCGCGA CGGCGTCGTC GTCGCCGACC CGTCCCGAGA CCTGCTCAAG
TCGGCGGTGC TCGACCGGTC CGGCCGGGCC GACCGCGTGC GGGTCGGGAT GGTTCGCGGC
CTCGGGCTCA CCCGGGGGGC GCTGGGGGCG ACCGCCGGGG CCGCCCCCGG TGACGTGGTG
ATCGTCGGTG CCGGCGACGC GGACATGCTG ACCGCGGCCC GGGCGTTGGA GGGGATGGGG
GGCGGGTTCG TCGTCGTCGA GCGTGGCTGG GTGCTGGCCG CCTGCCCGCT GCCGGTGGCG
GGCCTGATGA GCGACGCCCC CTGGGAGGCG GTGCTCGGCC AGCTCGCGGC CGTCGACACC
GCGGCCCGCG ACCTCGGCTG TCGGCTCGCC TCGCCGCTTC GTCTGATCGC CCAGCTCGGC
AGGGAGCTGT ACGTCCGGCC CTGA
 
Protein sequence
MTAERDVDRT VDAVPSHALP AAPVPVTPGP AAPVPMTPVP MTPVPMAPVP MAPGVVPVGP 
LRSLPPLAAP AVARVPGRRR ATPVPGRTGV VRGGRAPRPV TVPTPLREPL TPTPDELERL
RDIATGRAEA DFAIRGGVVF VVQTGEMMRR DILIAGRYIA AVTRPSAVGA RRSLDAAGRF
VLPAYVDARA SVERTLLGPG ELARLVVPRG TVTVLTDLAE IEQIHGPRGR SLVLRTGTPL
RVLPRGPAGT RDGLAGPPRP AAPHEGSAPY DATGPYDATG PYEATTPYEA TAPHQASVEA
VDVARAVPRQ VSPLTGVPGP FIGSAGPAAF LPGAAHLVGG PAGAEMPPVG GGLISDHLDD
RVREEIRAGG AAVRVVRDAT LTPARAFGLD HVLGSIAPAR LADLQIVQDL TARLPPDVVV
AGGRIAAERG RALFDNFDVA PTWASSSVRL PAGLYTGSFT SPCMHWSGRR DTRLTVVDVR
APAVATSPAA SIGVSGVSAS SVGLGPLGAG GEPPRVTVAR VEPTLRDGVV VADPSRDLLK
SAVLDRSGRA DRVRVGMVRG LGLTRGALGA TAGAAPGDVV IVGAGDADML TAARALEGMG
GGFVVVERGW VLAACPLPVA GLMSDAPWEA VLGQLAAVDT AARDLGCRLA SPLRLIAQLG
RELYVRP