Gene Franean1_4824 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4824 
Symbol 
ID5673165 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5760202 
End bp5762190 
Gene Length1989 bp 
Protein Length662 aa 
Translation table11 
GC content68% 
IMG OID641243680 
ProductNUDIX hydrolase 
Protein accessionYP_001509096 
Protein GI158316588 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0775] Nucleoside phosphorylase
[COG1051] ADP-ribose pyrophosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0491634 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGTCCG CCAGGGGGGC GTCTTTTTCT GTCAGCGTGA CAAGAAGCTA TTCTCGCCGC 
GTGGACGATG ATCCGTGGGA ACCTCCCGCC GTTATGGTCG CCGTCGACCT GGTGGTTCTT
ACATTGCGTG GGGCGAGCCT GCGGGTACTT CTCGTTGAAC GGGGAGTCGA GCCGTACCGA
AATTTCCTGG CGCTACCCGG AGGTTTTCTC GCCCATGCGG AGGAGGATCT GACCTCCGCG
GCCCGGCGCG AACTCTCACA GGAAACGGGG GTCGATCCGC GGCTGCTGCA TCTCGAACAG
TTCGGCGTCT ACGGAGAGCC GGGCAGGGAT CCGCGGGGCC GGGTCGTCTC GGTGGCCTAT
CTCGCGATCG AGCCACAGCT GCCCGAGCCA GTCGCCGGTA CGGACGCGAT CGGGGCGGGC
TGGCATCCGG TCGACCGGGT GCTCGCCGGG GACGTCACGC TCGCGTTCGA CCATCTCCAG
GTTGTCGCGG ACGGAGTCGA ACGAGCCCGT TCGAAGATTG AGCACTCGAC GCTGGCGACG
GCATTCTGCG CGCCCATCTT CACGATCGCG GAACTGAAGG ATGTATACGA GGCGGTGTGG
GGAGTACCCG TGGACCCACG GAACTTCTAC CGCAAGGTCC AGAAGACCGG CGGATTCATA
GTCTCCGCCG GCGCGACACG GCGGACGGCA GGCGGGCGCC CGGCCCGGCT CTTCAGAGCC
GGCCCGAGCG CTGTGCTCTC GCCGCCGCTG TCCCGACCGG CGGTCTCTCC GGGGAGTCCT
GGAACGGCCA GCGCAATGCA CACAGAGGGG GGAACCCGAA TGAGTCGGCG ACCAGTCGTG
ATTCTCACGG CACTCGATCT GGAATATCAA GCGATCCGTG AGAGCCTTGT CGACCCGCGC
CTGCACCGCC ATGACCAGGG CACTCGATTC GAACTCGGAC GGCTGGCCGG AGGCAGCTGC
CGGATCGCCC TCGCCCATGT CGGTAAGGGC ACCCATCCGG CCGCCGTGCT GGCCGAGCGC
GCGATCGCCG AGTTCGCCCC GGCGGCGCTG CTCTTCGTGG GGGTCGCGGG TGCCCTGCAC
GGACACATAG CCCTGGGCGA CGTCGTTGTG GCGACCCACG TGTACGCGTT CCACGGCGGT
ACCGGCGAGG ACGACGGGTT GAAGGCACGG CCCCGTGTCT GGGAGACCTC GCACGCGGCC
GACCAGATCG CTCGGCACAT TCACCGGACG GGTTCCTGGG CGCAGCGGCC AGGAGCTCCC
GCCATGCTGC CCGACGTGCA CTTCGGACCG ATCGCCGCCG GCGAGGTCGT GCTGAACTCC
AGGGTCTCCG GACTCGCCCG CTGGATACGT GAGCACTACA ACGACGCCCT GGCGATCGAG
ATGGAGGGGG CCGGCGTCGC GCAGGCCGGG CACCTGAACC GGGCGCTGCC GGTCGTCGTG
ATCCGTGGTG TCAGCGACCG CGCCGACGGG ACCAAGGAGT CCACGGACCG CCAGCGGTGG
CAGCAGCGCG CGGTCGCCAA CGCGGCACTG TTCGCGACGG CACTGGCCGA GGAGATTTCC
GCAGAGGACG ACGGCGCCGG ACGCCCGGCA GCGAAATCAA CGAAAGGAGC CGCAGTGCGC
GAGTACGTTC AGAGCGTTCA CAACGAGAAC TCCGGCAGCG GGCCGGTGGG TGTACAGGCC
CACACCGTGC ACGGCGGTGT CGTGCACGTC ACCGGCGGCG CTCGCCCGCC GGTCGACCTC
CCGATGGCCC TCGCGGAGAT ACGCACGCGG TTCCAAGCAG CCCGGGCGGC CGGTGCGGTG
GACGAGGACA CCTACGCCGC CGCGGAGGCG GAGCTCGCGG TGGCCGAGGA GGCCCTCAAA
GCGGACACGC CACAGAGCCA CAGCACGTTG CGGGTGGCGC TGAAGAAGCT CAAGGGGCTT
GTCGGTGACG TGTCCGATCT CGCCGCGAAG ATCACGATTG TCCTGGCGCT TGCCCGGAAT
TTGCCATGA
 
Protein sequence
MTSARGASFS VSVTRSYSRR VDDDPWEPPA VMVAVDLVVL TLRGASLRVL LVERGVEPYR 
NFLALPGGFL AHAEEDLTSA ARRELSQETG VDPRLLHLEQ FGVYGEPGRD PRGRVVSVAY
LAIEPQLPEP VAGTDAIGAG WHPVDRVLAG DVTLAFDHLQ VVADGVERAR SKIEHSTLAT
AFCAPIFTIA ELKDVYEAVW GVPVDPRNFY RKVQKTGGFI VSAGATRRTA GGRPARLFRA
GPSAVLSPPL SRPAVSPGSP GTASAMHTEG GTRMSRRPVV ILTALDLEYQ AIRESLVDPR
LHRHDQGTRF ELGRLAGGSC RIALAHVGKG THPAAVLAER AIAEFAPAAL LFVGVAGALH
GHIALGDVVV ATHVYAFHGG TGEDDGLKAR PRVWETSHAA DQIARHIHRT GSWAQRPGAP
AMLPDVHFGP IAAGEVVLNS RVSGLARWIR EHYNDALAIE MEGAGVAQAG HLNRALPVVV
IRGVSDRADG TKESTDRQRW QQRAVANAAL FATALAEEIS AEDDGAGRPA AKSTKGAAVR
EYVQSVHNEN SGSGPVGVQA HTVHGGVVHV TGGARPPVDL PMALAEIRTR FQAARAAGAV
DEDTYAAAEA ELAVAEEALK ADTPQSHSTL RVALKKLKGL VGDVSDLAAK ITIVLALARN
LP