Gene Franean1_6887 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6887 
Symbol 
ID5675200 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8390780 
End bp8392354 
Gene Length1575 bp 
Protein Length524 aa 
Translation table11 
GC content72% 
IMG OID641245736 
Producthypothetical protein 
Protein accessionYP_001511127 
Protein GI158318619 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAGGT CCTTTTCCGA ATCCGAGCTA GTTGAACTGG CGACGGTCTA CCACGAGAAG 
CGTTCCGCCA GATCACTGCT GGAGCGCGCG GGTATCGACC CCGGAAGCCA TCCCAGCTGG
GACAGCCTCA CCTCCCTGGA CTTCTGGCGG GCGGTGGGTG CGCAGGTCAC GTCGGGCTGG
GGCGGGTCGG ACACCGCCGA CCGAATCCTG GCCGCCGCGG CCAGGGATTA TGTCGGCAAC
GACGTCTTCC GCCGGCCGGC CGGCGGCCCG CCGCACGGCT CCCCGGTCCC GGCTTCGGCC
CCAGCCCCGG TTTCGGCCGC GGCCGGGAGC GCCGCCGCTG TGTCGGGCGC CTCCGCCGGG
GGTGGCCGGG CCGCCGAGCC CTTCGGGAGA TCCGGCCCGC CCGGCTCACC CGATCCGCTC
TCCGTCCTGG GGACGATTGT CGTGGTCGAC GCGGTGGGGT TCAGCAAGAA CGGGGCGCTC
GTCCATCTGG AGTGGCGCAA GGGCATCAGC GCCATCACGG CACGGGCTGC CGCTGCCGTC
GGTATCCCGC CAGGTCTCAT GCACTTCAAC GACCGCGGCG ACGGATTCAT GCTGATCATC
GACGGCCGGG TACCCCCTGA GACGGTGGTC GCCGACTTCA CCCGGGAGCT GGGGATCGCC
CTCGGCGAGT ACAACCGCAC ACGCAACAGC GCCGGCCGGA TCCGGCTGCG CATCGCGATG
CACGAGGGCC GGGCGTTCGT GGACGGCACG GGCCTCACCG GCACGCCCGC GATCGTCGCG
GCGAGACTGG TCGACGCCGA GGAGTTGCGC GACGTCCTCC GGAGGTCCGA CGGCCCGGAC
AGCGCCCTGA TCGTCTCGGA TGTGCTCTAC CAGTCCACGG TCATCGAGCG GCTGCGCGGG
CTCGACCCGG AGGACTTCGT CCGGGTCGCG GTCACGATGG CGAAGTACGA CGGTGTCGCG
TGGATCGCGG GACGCGAGAA CGCCGACGCT GCTCAGCCGG GCGGTTCGGG CGAAACCGCC
CGGACGGCGC CCGAGTCGCT ATCCGCACCC TCGCCGGCGA CCGGCAGGTG GGACTTCCTG
ATCTCGTGCA CGGCCGCCGA CGAGGACTGG GGCCAGTGGA TCGCCCATCA GCTCAAGACC
GCGAAGTACG AGGTGCACCT GGACGCCTTC GACATGGTGG CGGGCAGGGG ACGCGTCGCG
GAGTGGCACG ACGCGGTCCG GTACTCGACC CGGACGATCG CGGTGCTGAG CGAGAACTAC
CTGACCGCGG CCAGGGAGAT CCAGGCCCAG TGGCAGGCGG CCTGGGAGGG CGATGGCACC
GGCGCCGGGA ACAACTCGGC CGGCGACGGA GGCGGCAAGG GCGGGGAAGG CGGGGTCGAG
CGTCGGTTGA TCCCGGTGGT GGTCCGGCCG TGCTACCCGG ACGGCCTGCT CAAGGGCATC
ACGCCGATCG ACCTGACCAG ACTGCGGGAC GACCGGGATG CCGCCCGCGA CCATCTTCTG
GAGGAGGTCG CGGCCTCGCT GAGCGGGCGG CGCAGGCGCT CGGAGGACCC GCCCCCGTTC
CCGGGAGAAA GCTGA
 
Protein sequence
MARSFSESEL VELATVYHEK RSARSLLERA GIDPGSHPSW DSLTSLDFWR AVGAQVTSGW 
GGSDTADRIL AAAARDYVGN DVFRRPAGGP PHGSPVPASA PAPVSAAAGS AAAVSGASAG
GGRAAEPFGR SGPPGSPDPL SVLGTIVVVD AVGFSKNGAL VHLEWRKGIS AITARAAAAV
GIPPGLMHFN DRGDGFMLII DGRVPPETVV ADFTRELGIA LGEYNRTRNS AGRIRLRIAM
HEGRAFVDGT GLTGTPAIVA ARLVDAEELR DVLRRSDGPD SALIVSDVLY QSTVIERLRG
LDPEDFVRVA VTMAKYDGVA WIAGRENADA AQPGGSGETA RTAPESLSAP SPATGRWDFL
ISCTAADEDW GQWIAHQLKT AKYEVHLDAF DMVAGRGRVA EWHDAVRYST RTIAVLSENY
LTAAREIQAQ WQAAWEGDGT GAGNNSAGDG GGKGGEGGVE RRLIPVVVRP CYPDGLLKGI
TPIDLTRLRD DRDAARDHLL EEVAASLSGR RRRSEDPPPF PGES