Gene Franean1_6499 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6499 
Symbol 
ID5674814 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7900977 
End bp7903238 
Gene Length2262 bp 
Protein Length753 aa 
Translation table11 
GC content74% 
IMG OID641245347 
Productglycosyl transferase family protein 
Protein accessionYP_001510742 
Protein GI158318234 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.945206 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.608011 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGACCT CACCGACGCC GTCACCGCCG ACCGACCAGT TCGACCACGC GGCCTACGCG 
AGGGCTGACC GTGCGGCCTA CGCGAGGGCT GACCACGCGG CCTACGCGGG GGCTGACCAC
GCGACCGTTC CGGGCACGCA CTGGCCCGCC GGTGCTCCCG GGCGGGCGGG CGGGGCGTCC
GGGCGGCATC TGGCCGGCGA CGGTCGGATC GGCCGGTTCG CGCGGGGACG TCCCGACGAC
CCGCGGTGGG TGCGGCCCGC GCTGCTCGGG CTGCTCGCGG CGACGGCCGT CCTCTACCTG
TGGGGGCTGG GCGCCTCCGG CTGGGCGAAC GCCTTCTACT CGGCGTCGGT GCAGGCCGGC
TCGGTGAGCT GGAAGGCGAT GTTCTACGCC TCCTCGGACG CCGGGAACTC CATCACCGTC
GACAAGCCCC CGGCCTCGAT CTGGGTGATG GCGCTCTCCG CGCGCGTCTT CGGCGTGAAC
GCCTGGAGCA TCCTCGTCCC GCAGGCGCTG ATGGGCGTCG CCACGGTCGG CCTGCTCTAC
GCCGCGGTGC GCCGGGCCTT CCCCGCCGGC GCGGCGCTGC TCGCCGGGGC CGTCCTCGCG
ATCACCCCGG TCGCGACGCT GATGTTCCGG TTCAACAACC CGGACGCCCT GCTCGTCCTG
CTGCTGGTCG CCGCGGCGTA CGCGACGCTG CGCGCCGTCG AGACCGCGAG TACCCGCTGG
CTGGTCTGGG CCGGGGTGTT CGTCAGCTTC GGCTTCCTGA CGAAAATGCT GCAGGCGCTG
CTGATCGTCC CGGTGCTGGC CCTCGTCTAC CTCGTCACCG CGCCGACGCG GTTCACCCGG
CGGCTGTGGC AGGTCGGAGC GGGCGCGCTC GGGCTGATCG TCCCCTCCGG GATCTTCATC
GCGATCGTCG AGCTCGTCCC CGACTCGGCC CGCCCGTACA TCGGCGGGTC GCAACACAAC
AGCATCCTGG AGCTGACTCT CGGCTACAAC GGCCTGGGCC GGCTGACCGG TAACGAGTCC
GGCAGCGTCG GCGGCGGCGG CGCGGCGGGC GGCGCCGGTG GTGGCGGCAT GTGGGGCTCG
ACCGGGTGGG GCCGGATGTT CGGCTCCGAG GTCGGCGCCC AGATCTCCTG GCTGCTGCCT
ACCGCGCTCG CTCTGCTGGT GGCGGGCCTG TGGATCACCC GGCGCGCTCC CCGCACCGAT
CCCGGACGGG CCGCGCTCGC GGTCTGGGGC GGCTGGCTGC TGGTCACAGG CATCGTGTTC
AGCCAGATGC AGGGCATCTT CCACGCCTAC TACACGGTCG CGCTCGCCCC GGCGGTCGGC
GCCGTGGTCG GCATGGGAGC CGCGACCCTG TGGCGCCGCC GTGAGCACCC GATCGCCGCG
GCCACCATGG CCGGCATCCT GGTACTGACC GCGCTGTGGT CCTACGTCCT GCTCGACCGC
ACACCCGACT GGAACCCGTG GATGCGCTGG GTCGTGCTGA TCGTCGGCTT CGCGGCGGCA
CTGCTGCTCA TCGTGCTCTC GCGGCTGCCC CAGGCGGCCC GCGTGGCGGT CGTCGCGGCC
GCGCTGGTGG CGGCGCTGCT CGGGCCGTTC GGCTACTCGG TCGCGACCGC GGCCACCCCG
CACACCGGCT CCATCCCGTC CGCGGGCCCC GCAGGCTCGG GCTTCGGCGG CCCGGGCGGC
GGACCTGGCG GAGGCGGCGG ACGGCAGCTG TTCGGCGGTC CTGGCGGCGG CAACGGCGGG
GCCCAGCAGG GCACGACGCA GGTCCCCGGC GGCGGCACGG CGCCCGGCGG TACCACGCCG
GGTGGGGCGG CTCAGGGCGG CACGCAGGGC GGGCCCGGCG GCGGCATGGG CGGTGGCATG
GGCGGCCTGC TCGACGCCGG TAAGCCCAGC GATGAGGTCC TCGCGCTGCT GAAGGCGGAC
GCCTCGTCCT ATACCTGGGT GGCGGCGTCC GTCGGGTCGA ACACCGCGGC CGGCTACCAG
CTCGCCAGCG GCGATCCGGT GATGGCGATC GGCGGCTTCA ACGGCAGCGA CCCCTCCCCC
ACCCTTGAGC AGTTCAAGCA GTACGTGGCG GACGGCCGCA TCCACTACTT CATCGGCGGC
GGCGGCTTCG GCGGGCAGAA CGGCGGAAGC CGGGCGTCCA GTGACATCGC CGCCTGGGTG
GCGGCGAACT TCACCGCCAC CACCGTCGAC GGCACAACCC TCTACAACCT GACCACCCCA
ACCACCACCG GCACCACCAC CAGCGGCACC ACCACAGCCT GA
 
Protein sequence
MPTSPTPSPP TDQFDHAAYA RADRAAYARA DHAAYAGADH ATVPGTHWPA GAPGRAGGAS 
GRHLAGDGRI GRFARGRPDD PRWVRPALLG LLAATAVLYL WGLGASGWAN AFYSASVQAG
SVSWKAMFYA SSDAGNSITV DKPPASIWVM ALSARVFGVN AWSILVPQAL MGVATVGLLY
AAVRRAFPAG AALLAGAVLA ITPVATLMFR FNNPDALLVL LLVAAAYATL RAVETASTRW
LVWAGVFVSF GFLTKMLQAL LIVPVLALVY LVTAPTRFTR RLWQVGAGAL GLIVPSGIFI
AIVELVPDSA RPYIGGSQHN SILELTLGYN GLGRLTGNES GSVGGGGAAG GAGGGGMWGS
TGWGRMFGSE VGAQISWLLP TALALLVAGL WITRRAPRTD PGRAALAVWG GWLLVTGIVF
SQMQGIFHAY YTVALAPAVG AVVGMGAATL WRRREHPIAA ATMAGILVLT ALWSYVLLDR
TPDWNPWMRW VVLIVGFAAA LLLIVLSRLP QAARVAVVAA ALVAALLGPF GYSVATAATP
HTGSIPSAGP AGSGFGGPGG GPGGGGGRQL FGGPGGGNGG AQQGTTQVPG GGTAPGGTTP
GGAAQGGTQG GPGGGMGGGM GGLLDAGKPS DEVLALLKAD ASSYTWVAAS VGSNTAAGYQ
LASGDPVMAI GGFNGSDPSP TLEQFKQYVA DGRIHYFIGG GGFGGQNGGS RASSDIAAWV
AANFTATTVD GTTLYNLTTP TTTGTTTSGT TTA