Gene Francci3_1602 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1602 
Symbol 
ID3903737 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1922655 
End bp1924391 
Gene Length1737 bp 
Protein Length578 aa 
Translation table11 
GC content70% 
IMG OID637878939 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_480707 
Protein GI86740307 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.484756 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGCACG CGACATTGCT GCCGGAAGTC CTACAGAACC GCGCCGCCCG CCAGCCGGCC 
AGGCGGGCCT ACGTGTTCGT GGATGAACAC GAGGCGGAAA AGGCGGTACT GACGTACGGC
GACCTGCACG CGCGGGCGCT CGCCGTGGCC GGGGAGCTGA TCCGGCGCTG CCGGCCCGGC
GACCGGGCGC TGCTGCTCTT CCCGCCGGGT CTGGATTTCA TCGTCGCCTA CTTCGGCTGC
CTCTACGCGC AGGTGATCGC GGTCCCCGTC AACCCCCCGC GCAGGAACCT GATCCAGGAC
GCGACCCGGA GCATCATCAA GGACTGCGAG CCTTCGGCCG TGCTCACCGT CGGCGCGATG
GTCGAGCACA TCAGGCCCGT CGTGGAATCG ATCCGTGGCC CCCTCCCCTG GCTGCCGGTC
GACCAGGTGG CGGACGAGAC GAACGAGACG GACCAGGCGG GCACGAGCTT CCGCCCCCGG
CCCTGTCCGC CGGATTCCGT CGCCTTCCTT CAGTACACCT CCGGTTCCAC GTCCGATCCG
AAGGGGGTCA TGGTCTCCCA CCGGAACCTC GCCGCGAACC AGGAGATGAT CCGGCGCGCG
TTCGACCACG ATCAGGACTC GACGTTCGTC GGCTGGGCAC CGTTCTTCCA TGACCAGGGG
CTGATCGGCA ACATCCTGCA GCCGCTCTAC CTCGGGGCGA CCAGCATCCT CATGGCGCCG
ATGACGTTCA TCCGGTGGCC CCTGCGCTGG CTGTCGGCCA TCTCCCGGTA CCGGGCCCAC
ACCAGCGGCG GGCCCAACTT CGCCTTCGAT GTCTGCGTCG CACGGGCCGC CCGGGGGGAT
GTGCCGGACC TCGACCTCAG CTGTTGGAAG GTCGCGTTCA ACGGGGCCGA GCCCATCCGT
CACGAGACCC TGCGCCGGTT CTCGGCGATC TTCGCGCCCC ACGGGTTCGA CGAGAAGGCG
TTCTACCCGT GCTACGGCCT GGCCGAGGCG ACCCTGCTCG TGACCGGCAG CCGGAAGGGC
CGCGGTCCCC GCGCCCTCGA GGCGGACGTC GAGGCGCTCG GTCACCGGCG CTATGTGCCG
GCATCGGGCG GACGCGGCCG GAGTCTCGTC GGATCCGGGC TCGTCCTCCC GGAGGAGGAG
CTCCGGATAG TGGACCCCGA AACGGGACGC CCGTGCCCCG CGGACGAGGT GGGCGAGATC
TGGGTCTCCG GCGACCAGGT GGCGCAGGGA TACTGGCGCC GCCCGGAGGC GACGGCCGAG
GTGTTCCACG CCGAGTTCGA CGGCGAGACC GGCCGGGCTT ACCTGCGCAC CGGCGATCTC
GGCCTGCTGG TCGACGGCGA GGTCTACGTC GTGGGCAGGC TGAAGGACCT GGTGATCATT
CGGGGCCGGA ACTACTATCC CCACGACATC GAGCTCACCG TCCAGTCGGC CCACCCCGCG
TTGCGCCCCG GCGGGTGCGC CGCGTTCTCG GTTCCCGGTG CCGACAGCGA GAAGCTGGTC
GTCGTGCAGG AGATCAGGGA CGAGCAGCGC CTCACCGCCG ACGCGAGGGA CGTCGCTGCG
TCGATCCGGG CGGCGGTGAC GCGGGAACAC GACCTCTCGG TGAACGACCT CGTGCTGGCC
CTGCCGGGCC GGCTACAGAA GACCAGCAGC GGCAAGATCA TGCGAGCCGC GGCCAGGAAC
CGCTACCTGG CGGCCGGGTT CGAGATCTGG GAACCGGGGA TGTCCTCCGT CGCCTGA
 
Protein sequence
MPHATLLPEV LQNRAARQPA RRAYVFVDEH EAEKAVLTYG DLHARALAVA GELIRRCRPG 
DRALLLFPPG LDFIVAYFGC LYAQVIAVPV NPPRRNLIQD ATRSIIKDCE PSAVLTVGAM
VEHIRPVVES IRGPLPWLPV DQVADETNET DQAGTSFRPR PCPPDSVAFL QYTSGSTSDP
KGVMVSHRNL AANQEMIRRA FDHDQDSTFV GWAPFFHDQG LIGNILQPLY LGATSILMAP
MTFIRWPLRW LSAISRYRAH TSGGPNFAFD VCVARAARGD VPDLDLSCWK VAFNGAEPIR
HETLRRFSAI FAPHGFDEKA FYPCYGLAEA TLLVTGSRKG RGPRALEADV EALGHRRYVP
ASGGRGRSLV GSGLVLPEEE LRIVDPETGR PCPADEVGEI WVSGDQVAQG YWRRPEATAE
VFHAEFDGET GRAYLRTGDL GLLVDGEVYV VGRLKDLVII RGRNYYPHDI ELTVQSAHPA
LRPGGCAAFS VPGADSEKLV VVQEIRDEQR LTADARDVAA SIRAAVTREH DLSVNDLVLA
LPGRLQKTSS GKIMRAAARN RYLAAGFEIW EPGMSSVA