Gene Franean1_3775 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3775 
Symbol 
ID5672140 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4475144 
End bp4476985 
Gene Length1842 bp 
Protein Length613 aa 
Translation table11 
GC content72% 
IMG OID641242656 
Producthypothetical protein 
Protein accessionYP_001508076 
Protein GI158315568 
COG category[R] General function prediction only 
COG ID[COG0433] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.464348 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGTTG AGGTCGGCGG GTCGAACGGC AGGCTTACCG CCGGCCCGGT GCGGGCGGCG 
GCGGCCGTGG CGCCGGTCGG CCGGGTGCTC GGTACGGAGG AGAACACGCC TCTGCTGTTC
CACGTCGCCC TGCTGGAGGG CAGCTACCTG CAGCTCGACG ACGTGGTCGT CACGGTCCGC
GCGGTGCCCG GGGTCGGGCC GGTGATGACC GCCGGCATCG TCACCCAGGT GCGGGCCCGG
CACGAGGGCG CGTCGTTCGG CTCGGATGTC TTCCTGATCG CCGACGGCGT CCTGCCCGCG
CAGGTGCAGG AGATCGCCGA GATCACCACG ACCCGGGTGG AGCCGGAGGT GTACGTCCCG
CCGCTGCCCG GTGAGCAGGT ACGGCGGGCG ACCGGTGCGG AGCGGGCGAC GGCGCTGTAC
TTCGACATGA TGGAGAAGCG GATCCCGGTC GGCATCGGCC GTGACGGAAA CCCGATCTAC
ATCAATCTCG AGTTCCTGGA CGGCACCCGG GGCGCGCACG TCTCTATCAG CGGCATCTCG
GGGGTCGCCA CCAAGACCAG CTTCGCGCTC TTCCTGCTGC ACTCGATCTT CCGCTGCGGG
GTGCTGGGGC GGGCCGCGGT GAACACCAAA GCGCTGGTGT TCTCGGTGAA GGGCGAGGAC
CTGCTGTTCC TCGACCACGC CAACTCCCGG CTCGACGAGG ACATGCGCGC CGCGTACCGC
CGGCTCGAGC TGCCCGCCGA GCCGTTCCGC TCGGTCGGCT TCTTCGCCCC GCCGCTGCCC
GACGACACCT CGGGACGTCC GCACGTCGCC TCCCGGCCGA CGGGGGTGGC CGCGTTCTGG
TGGACGATCC AGGAGTTCTG CGCGGGTGAG CTGCTCCCGT ACGTGTTCGC CGACGCCGAG
GACGACCGCA ACCAGTACAC GATCGTCGTC CACCAGGTGG CCCTGCGGCT GCGCACGGAC
GCGGTGCCCG CCGGCGCTGA CGGCGCGGTC AGCCTCGACG GACAGCTGGT GCGGACCTAC
CCCGAGCTGG TCGACCTGAT AGTCGACCGG CTGACCGACG AGGAGACGCG CCGGGACTGG
GCCGGGCCGG TGACGGGTGC CGGGACGGTC AACGCGTTCG TCCGCCGGCT GCGTTCGTCG
TTGCGTTCGC TGCGGTCGCT GATCCGGGCG GATCTGGCGG ACAGCCCGCG CCGGCGGGTG
TCCACGGCCG ACCAGCAGGT GACCGTGGTC GACCTGCACA ACCTGCCGGA GCGGGCGCAG
CGGTTCGTCG TCGGGGTGGT GCTCGCCGCG GAGACGCGGC GCAAGGAGGA GGCCGGTGCG
GGCGGGCTGC TGTTCACGAT GATCGACGAG CTGAACAAGT ACGCCCCGCG GGAGGGCGCG
AGCCCGATCA AGGAGGTGCT GCTCGACATC GCCGAGCGCG GCCGCTCGCT GGGCATCATT
CTGATCGGCG CGCAGCAGAC GGCGAGCGAG GTCGAGCGGC GGATCATCGC CAACAGCTCG
ATCAAGGTGG TCGGCCGCCT CGACTCGGCC GAGGCGGGCC GGCCGGAGTA CGGGTTCCTG
CCGCCCGGGC AGCGCGCCCG GGCCACGCTG GCGAAGCCGG GCACGATGTT CGTCTCCCAG
CCGGAGATCC CGGTGCCGCT CGCCGTCGAG TTCCCGTTCC CCGCCTGGGC GACCCGGCAC
TCCGAGACCG CGGGCCTTGA GACCGCGGGC CTTCCCTCCG AAGCCACCGG CCCGCCGGCC
GGCCCGGTTC CCGGCCAGCC CGGAAGACCG GCCCCCGGCA CGATTCCCCG CAATCCGTTC
GACCTGCTAC CCAACCCCAC CGACGAGGTT CCGCCGTTCT GA
 
Protein sequence
MTVEVGGSNG RLTAGPVRAA AAVAPVGRVL GTEENTPLLF HVALLEGSYL QLDDVVVTVR 
AVPGVGPVMT AGIVTQVRAR HEGASFGSDV FLIADGVLPA QVQEIAEITT TRVEPEVYVP
PLPGEQVRRA TGAERATALY FDMMEKRIPV GIGRDGNPIY INLEFLDGTR GAHVSISGIS
GVATKTSFAL FLLHSIFRCG VLGRAAVNTK ALVFSVKGED LLFLDHANSR LDEDMRAAYR
RLELPAEPFR SVGFFAPPLP DDTSGRPHVA SRPTGVAAFW WTIQEFCAGE LLPYVFADAE
DDRNQYTIVV HQVALRLRTD AVPAGADGAV SLDGQLVRTY PELVDLIVDR LTDEETRRDW
AGPVTGAGTV NAFVRRLRSS LRSLRSLIRA DLADSPRRRV STADQQVTVV DLHNLPERAQ
RFVVGVVLAA ETRRKEEAGA GGLLFTMIDE LNKYAPREGA SPIKEVLLDI AERGRSLGII
LIGAQQTASE VERRIIANSS IKVVGRLDSA EAGRPEYGFL PPGQRARATL AKPGTMFVSQ
PEIPVPLAVE FPFPAWATRH SETAGLETAG LPSEATGPPA GPVPGQPGRP APGTIPRNPF
DLLPNPTDEV PPF