Gene Franean1_5933 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5933 
Symbol 
ID5674254 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7205853 
End bp7206947 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content69% 
IMG OID641244781 
Productadenosine deaminase 
Protein accessionYP_001510183 
Protein GI158317675 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG1816] Adenosine deaminase 
TIGRFAM ID[TIGR01430] adenosine deaminase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.2838 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0016274 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
GTGGACATCC CTGCTGGCCA GGCCAGCGAG AAGATCACTG AGGCTGCGAT CCGCCGGGTT 
CCCAAGGTCC TGCTGCACGA TCATCTTGAT GGTGGCCTGC GGCCCGCGAC CATCGTCGAG
CTGGCTGACG CCACCGGGTA CACCAGGCTG CCGACGACCG ATGTCGACAA GCTCGGTACC
TGGTTCCGCG GAGGTGCGCA CACCGGGTCG CTCGTGCGGT ACCTGGAGAC GTTCAGCCAC
ACGGTGGGCG TCATGCAGAC GCCCGAGGCC GTGGCCCGGG TGGCCCGCGA GTGCGCCGAG
GATCTGGCCG CCGACGGGGT CGTCTACGCG GAGGTCCGGT TCGCCCCGGA GCTCCACGTC
GAGCAGGGCA TGTCGCTCGA CGAGGTGGTC GAGGCCGCGT TGGACGGCTT CCGCGCCGGC
TCGGCGGGAA CCGGCCTGCA CGTGCGCGCG CTGGTGACCG CCATGCGCCA CCAGGCCCGC
TCGTTGGAGA TCGCGGAGCT GGCCGTCCGG TGGCGGGAGG CCGGGGTGGT CGGGTTCGAC
ATCGCCGGGG CGGAGGCGGG CAACCCGCCG ACCCGCCACC TGGACGCGTT CCAGTACATC
CAGCGGGCGA ACGGGCACTT CACGATCCAC GCGGGTGAGG CGTTCGGGCT GCCGTCGATC
TGGGAGGCGC TGCAGTGGTG CAACGCCGAC CGGCTGGGGC ACGGCGTGCG CATCGTCGAC
GACATCACGG TCGACCCGGA CGGAAACGCC ACCCTCGGGG ATCTGGCCGA CTATGTGCGC
GATGTCCGTG TCCCGCTGGA GATGTGCCCG TCGTCGAACG TGCACACCGG GGCGGCGCCG
AGCATCGAGC GCCATCCGAT CGGTCTGCTG CGCAGGCTGC ACTTCCGGGT CACGGTGAAC
ACCGACAACC GGCTGATGAG CGGGGTGACG CTGTCCAGCG AGTTCGCGAC CCTGGTCGAG
ACGTTCGGCT ACGGCTGGTC CGACATCCGC TGGCTGACCG TGAACGCGAT GAAGTCTGCC
TTCCTCCCAT TTGACCAGCG TCTCGCGCTG ATCAACGAGG TCATCAAGCC CGGCTTCGAG
GGGCTCGTCC CGTGA
 
Protein sequence
MDIPAGQASE KITEAAIRRV PKVLLHDHLD GGLRPATIVE LADATGYTRL PTTDVDKLGT 
WFRGGAHTGS LVRYLETFSH TVGVMQTPEA VARVARECAE DLAADGVVYA EVRFAPELHV
EQGMSLDEVV EAALDGFRAG SAGTGLHVRA LVTAMRHQAR SLEIAELAVR WREAGVVGFD
IAGAEAGNPP TRHLDAFQYI QRANGHFTIH AGEAFGLPSI WEALQWCNAD RLGHGVRIVD
DITVDPDGNA TLGDLADYVR DVRVPLEMCP SSNVHTGAAP SIERHPIGLL RRLHFRVTVN
TDNRLMSGVT LSSEFATLVE TFGYGWSDIR WLTVNAMKSA FLPFDQRLAL INEVIKPGFE
GLVP