Gene Franean1_6357 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6357 
Symbol 
ID5674673 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7716326 
End bp7718473 
Gene Length2148 bp 
Protein Length715 aa 
Translation table11 
GC content71% 
IMG OID641245206 
ProductTIR protein 
Protein accessionYP_001510601 
Protein GI158318093 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGAAC ACTGCGATGC GGACGGCCAT GCCGGTTACG ACTTCTGCGT GTCGTATGCG 
CTGGCGGACA CGGCCTGGGC GACGTGGATT GCCTGGACGC TGGAGGAGAC CGGGTACCGG
GTTCAGATCC GAGCCTGGGA TTCCGTTCCC GGATCCCACT GGGTTGCGGC CATCGACGAC
GGCCTGCAAC GGTCCGACCG GACGATCGCG GTGCTGTCCG ACAACTACCT TCGTTCAGAG
CTGCACCAAG CGGAATGGCG GGCTGCCTGG GCCGACGACC CGACGGGCCG GAGCCGACGG
CTTCTCGTCG CCCGGGTCAC GGACTGCGCG CCGCCGGGGC TGCTCGGCTC GGTCGTCCCG
GTCGACCTAT TCGGCCGGCC GCGGGATGTC GCTCGTTTTC AGCTGCTGTG GGCGGCGGGG
CTGGCGGTCT CGGGCGGACG GGGAAAGCCC ACTTCCGCGC CGCCGTACCC GCAGGACGAC
AGGGTGGTGT CCGTCCAGCC GTCGTTTCCC GGTCCGGAGG CGCCGGATTC CGCAGGGCAC
GGCGCCATGT TCTTCTCCTG TCCCGAGCCG CGCCTTTCCC TTGCCGGCGC GCCGAGTGTG
CTGCTGCGGC CCGAGCACAG TGTCGCGCCG TTCGTCGAGC TCGACGGCGA CCGGGCGGCG
CTGCGGCTGT GGGCGGCCGG ATCGGACCCG GTCTCCGTGC GGCTGCTGAA GGGTGCGGCC
GGCTCGGGCA AGACCCGGCT GGCGATGCGG GTCTGCGCGG AGCTGAGCGA GCGGGGCTGG
CTGACGGTGG TGGTCAACTC GTCGTTGAGC CCAGCCCAGC TTCAGTGCCT GTGCGAGTCC
GACGTCCCGG TGCTGGTGGT CGTCGACAGC GCCGAGCTCC GTCTCGACCA GGCCGTTCTG
GTCGCCGCCG CACTTGTGGA GGGGACCGCG CCTGGTGCCT CGCCGCGGCG GCTGCTCCTC
CTCACCCGGT CCACCGAGCT GTTGCTGGAG GATCTGTGGT CGCTGGCCGA TAGCCGGGCC
GACGCTGGGG CCGCCGATCT GTTCCGGGCG GCCCGGGTTC AGGCGAAGAC GCCGCTTCCG
GCCGGGGCCG ACGCGCGACA CGTCCACTTT GAGGCTGCCT ACACGGCGTT CCGTGTCGAG
CTGCGGCAGA TCGGGCCGAT CGCCGCGCCT CCGGCTGATC TCGTCGACCT TCACTCGGTG
CTGGACGTCC ACGCGGCGGC GCTGAACGCG TGTCTCGACA GCACCCGCGG CTCCCGGGCC
CCGCTTGCCA CGCTCCTGGA TCATGAGCGG CGCCGCTGGC CCGGAGGCGA TCGTGGCGGC
GGCCGTCCTC GGCCGGGTCT GACCGACAGG ATCTGCGCGC TGGCCACACT CCTGCGGCCT
GGCTCCTTCC TGTCGGCGGA GGCGTTGTCG GCCCAGCTGC CCGGGATCCT CGGTGTCGAC
GCCGAGCGGA TTCGGCGGTG TCAGGACGAG CTCGGTGCGC TCTATCCGGG TCGATGGAGG
TTCGATCAGA TCCGGCCCGA CGCATACGGG GAACACGTGG CCGCCGGCAC TCTGGTAGCC
GACCCTTCGC TTGCTCGTGC CCTGGGCCGT CTCTGTTCTG CCGAGCAGGC GGCGGACGTC
CTCGCGGTGC TCGGCTGCGC GCTGCCCCGC CATCCGGAGC TGGCCACGAC GATCGTGGAG
ATGCTGCGGA CGAGGCCCGA CGAACTGGTT CTCGTGGCGG CCGACGTCGC GACCCGCCTG
CCTGACCCGG AACAGTTCGC ACGGACCACC GCCGACGCGT TTGACGACGC GACCATCCGC
CAGCTCACGC TCGTCACGGC GACGGCGCTG GCCGAGCGGG TACGGCGCGG CGGCCGGGAA
CTCGGCCCGC TGCGTGGCCT GACCCTGCGT GCGTCGGCCG CGGGCGGACG CAGGCTGACC
GCGGGTTTCA CCGAACCGGC GGCGCCGGAT CCCACAACGG CCGGCCTGGC CAGGGTTGCC
GAACGAGTGA CCGACGGCAT TGTCGATCTT CTTGTGGGCC TTGTCGACCC AGGTTCCGGG
CGCATGCCTC CGCAGGCTGA CGGAAGTCCG AACATCGCAC CAGATCTCCT TGGCACGCTT
TTTGACCTCC AACGACATTT CCGGAGGTCG GATCCCGAGA CGAAGTGA
 
Protein sequence
MTEHCDADGH AGYDFCVSYA LADTAWATWI AWTLEETGYR VQIRAWDSVP GSHWVAAIDD 
GLQRSDRTIA VLSDNYLRSE LHQAEWRAAW ADDPTGRSRR LLVARVTDCA PPGLLGSVVP
VDLFGRPRDV ARFQLLWAAG LAVSGGRGKP TSAPPYPQDD RVVSVQPSFP GPEAPDSAGH
GAMFFSCPEP RLSLAGAPSV LLRPEHSVAP FVELDGDRAA LRLWAAGSDP VSVRLLKGAA
GSGKTRLAMR VCAELSERGW LTVVVNSSLS PAQLQCLCES DVPVLVVVDS AELRLDQAVL
VAAALVEGTA PGASPRRLLL LTRSTELLLE DLWSLADSRA DAGAADLFRA ARVQAKTPLP
AGADARHVHF EAAYTAFRVE LRQIGPIAAP PADLVDLHSV LDVHAAALNA CLDSTRGSRA
PLATLLDHER RRWPGGDRGG GRPRPGLTDR ICALATLLRP GSFLSAEALS AQLPGILGVD
AERIRRCQDE LGALYPGRWR FDQIRPDAYG EHVAAGTLVA DPSLARALGR LCSAEQAADV
LAVLGCALPR HPELATTIVE MLRTRPDELV LVAADVATRL PDPEQFARTT ADAFDDATIR
QLTLVTATAL AERVRRGGRE LGPLRGLTLR ASAAGGRRLT AGFTEPAAPD PTTAGLARVA
ERVTDGIVDL LVGLVDPGSG RMPPQADGSP NIAPDLLGTL FDLQRHFRRS DPETK