Gene Franean1_5438 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5438 
Symbol 
ID5673769 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6579199 
End bp6581511 
Gene Length2313 bp 
Protein Length770 aa 
Translation table11 
GC content69% 
IMG OID641244293 
Producthypothetical protein 
Protein accessionYP_001509699 
Protein GI158317191 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.180108 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0529249 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCACG GGCCCATTAC GCATGTGCCG TGGACAGATG TAACGTTCAT CTGTCCACGG 
CATCCCCAGG GCATGTGCAG AGACCTCCTG GGGACGTCGA CTACTGCCGA TTCGGGGGAA
CAGGCAGGTG ACCCGGTGAT CGCATCACAG GCGAACGCAC CCAGCGAAGA ACTCGTCGCC
AGAATTTTCG GTCCGAACGA ACGCTGGGGC TGGGAATACG TGCCCCCGCA TCCCGCGACG
GCGCACCACG AGGCCTTCCA GCCGCCGTCG CGGATCCAGA TTCCGGTGCC CGACCTGCGG
GCGCTCGAGT TCCGCAAGCG ACAGTTGTCA GGGAAGATCT GGCGGCCGAT CGTCGGCGGT
CTGTTCCTGC TCGGCGGGCT CGGGACGCTT GCCGGATCAG TAGGCGGCCT CGGTCCTCTC
CTCATCGGCG TCGTCCTGCT CGCCTGGTAT TTTGTACCGA TTCAGACGGT CAGCAATCAA
ATGAAGTCCA TCCAGGCGGG CTACCAGGCG GAGATCGCCC GCCGCGAGCA GGACTACCAG
GTCGCCTATG CCGCCTGGCA GCAACGCATC CACCAGCACG ACCAGGCCGA GCAATACCGG
GTCGGATCGG CGCTGGAGTT CTATCCGTTG GATCCGCAGC GCCCGGCGCG GATCGACGTC
TTCGGTGGCA CCGGCGCCGG CTGGACAAGC CTGCTGGCCA CCGGCGGGAC GTCGCTGCTC
GCGGGCGGGT CCGGAATTCT GCTGCTCGAC CTGTCGGAGC TCAGCGTCGG CGCCGGCCTG
GTCATGCTGG CCAACAACGC GGCCACGCCG ATCTCGGTCG ACGTCCGGGA GCTGCCCGGC
TCGCTGGAAC GGATCGGCCT GCTCGGGGAG CTCGACCTGC GGCAGGTCGC CGAGCTGCTC
GCCGACGCCT TCGACGCGGA CCGCCGGGGC GGGGGGGACC AGAGCCTGCG CGCGATCGAC
TTCAACATCC TGCGCAGCGT TGCCTCCTCG ATTGAGGCGC CGCTGACCTT CGCCCGGCTC
GCCGCCGCGT TGCGCATCCT CGACAACCAG AGTTCGGCCG TCGGCGAGGG CGTGTTCAGC
GACTACGAGG TCCAGGCGCT GCAGCAGCGA ATGTATGACC TCGGCCAGCG GGAGCGCACG
GCCGACCAGA TCAGCTTCCT GCGCACGGAG TTGGAGACGC TCGCCGGCTC CGACCCCACG
GCGGCCGATG TCCCGCCGGC CGCACCGTCG GCCTGGTGGC CGGGCGGCGG GCTGCGGGTG
CTGGCGACGA CAAGCTCGGG GCGCGGCAGC TCCAAGCGGC GCAAGCTGCT CACCGACCGG
ATTCTGGTCG AGCGGTTGCT GCACCAGCTG CGCAGTCACG AGCGCGCCGC TTCCAACGAC
GTCGTCGTGA TCGCCGGCGC CGACCACCTC GGCCGGGAAA CACTGACGAC GCTGACCCGG
CAGGCGGAGG TCGCCCACGT CCGGCTCGTC CTGCTGTTCA AGAACCTCAG TGACGACGCG
GAAAGGCTGA TCGGCACCGG CGACAGCGCG GCGATCTTCA TGCGGCTGGG TAACGCCCGG
GAGGCCTCAA GCGCGGCCGA TCACATCGGT AAGGGCTTCA GTTTCGTGCT CTCCCAGGTC
ACCAACCAGA TCGGCGACAG CTTCACCGAA GGGTTCGCCA ACAGCTACGG CGAGCAGGAC
GGCACCGCGT TCACCCGGGG AGAGGGCCGC ACCAGCGGTT CCGGCCCCGG CGGCGGAAGC
AGCGGCCGGA ACTGGAACCA GTCGAGCACG ACGTCGCATT CGTCGACGTG GACCAACACC
GTCAACGTCT CCAACACGGT GAGCCGGAAC CTCGGCACGA CTCTGCAGCG GGAGAAGGAC
TACACCGTCG AGCCGACCGT GCTGCAGAGC CTGGCCGCGA CCGCGTTCAT CCTGGTGGGC
ACGTCCAGCG GTTCCGGTCG GGTCCGCCCC GGCGACTGCA ACCCAGGTCT TTTGCTGCTG
CCCAAGGTCG CGGACAGTCC CCGGGACCTC ACCGCAGCGC CGCACACCCA CGAGGCGGGG
GACCCGAACC AGGGCGGCGC CACCGCCACG CACCCGGGCC CGGTGTCACA GCGGGAGTTC
CAACATCAGC TGCCGCCGGG GTACTCGCAC CCTCAGTCGG GCTACCCGCA GCAGCAGCCG
GGCTACCAGC ACACTCAGCC GGCCTATCCG CAGCAGCAGC CGGGCTACCC ACCGTCGGGC
TACCCACCGT CGGGCTACCC GCCGTCGGGT TACCCGCAGC AGCAGCCCGG CTATCCGCAA
CAGCCGCCAC ACACCTGGCC GGGACAGCAG TAG
 
Protein sequence
MLHGPITHVP WTDVTFICPR HPQGMCRDLL GTSTTADSGE QAGDPVIASQ ANAPSEELVA 
RIFGPNERWG WEYVPPHPAT AHHEAFQPPS RIQIPVPDLR ALEFRKRQLS GKIWRPIVGG
LFLLGGLGTL AGSVGGLGPL LIGVVLLAWY FVPIQTVSNQ MKSIQAGYQA EIARREQDYQ
VAYAAWQQRI HQHDQAEQYR VGSALEFYPL DPQRPARIDV FGGTGAGWTS LLATGGTSLL
AGGSGILLLD LSELSVGAGL VMLANNAATP ISVDVRELPG SLERIGLLGE LDLRQVAELL
ADAFDADRRG GGDQSLRAID FNILRSVASS IEAPLTFARL AAALRILDNQ SSAVGEGVFS
DYEVQALQQR MYDLGQRERT ADQISFLRTE LETLAGSDPT AADVPPAAPS AWWPGGGLRV
LATTSSGRGS SKRRKLLTDR ILVERLLHQL RSHERAASND VVVIAGADHL GRETLTTLTR
QAEVAHVRLV LLFKNLSDDA ERLIGTGDSA AIFMRLGNAR EASSAADHIG KGFSFVLSQV
TNQIGDSFTE GFANSYGEQD GTAFTRGEGR TSGSGPGGGS SGRNWNQSST TSHSSTWTNT
VNVSNTVSRN LGTTLQREKD YTVEPTVLQS LAATAFILVG TSSGSGRVRP GDCNPGLLLL
PKVADSPRDL TAAPHTHEAG DPNQGGATAT HPGPVSQREF QHQLPPGYSH PQSGYPQQQP
GYQHTQPAYP QQQPGYPPSG YPPSGYPPSG YPQQQPGYPQ QPPHTWPGQQ