Gene Francci3_4204 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4204 
Symbol 
ID3907169 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp5019607 
End bp5021232 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content67% 
IMG OID637881532 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_483281 
Protein GI86742881 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAGTT ACACCGGCGC CTTCTCTTTG GAAATCCCGG ATTTATGGAA AAACCATGAG 
AACTGCGCTT TCCGTATTCT TTACGGACTG GTCCAGAGGC CGAACTCGCT CGTGGTGCGC
TGGCGTGACA GAGAAATCAC GGCCGAGCTT CTCGCCGAGC TGATCCTGAA TGCTGCAGAA
ACCTTCCTGC AATGCGGTGC CGGCCCACGC GAAGCCATTG CCGTCCTTGC ATCGTCCAAT
CATCCGGCGA TGCTCGTCTG CCGATATGCG GCACACCTCA TCGGGGCATC CGTCGTCTAC
GTCCGGGCCG CCAACCCCCG CACCGACGCC GAAATGCTGT CGCGCCAGGT GCAGTCCCGG
ATCCTCGAGG AGGTCGGGGC CCGCGTCCTG GTCGTCGACG CATCACACAT CGACCGGGGA
CGAGCTCTCG TGGCGTCCGC CACGACGCTG CTCACCCTCG TGACGGAAAC GTACCCCGCG
GTGCCGCTCG ACACAGGTCG TACGCCCCGG CTGCCTGATC TTCCCCCGTA CGACGGCGAC
GCGCGTGCTC TGGTCACGTT CACCAGCGGA AGCACGGGAC AACCCAAGAG CTTGTCTCAG
TCGTACCGCA CCTGGAACGC GACCGTCCGC GGGTTCTCCG GCCGGACCGA TACCCACCTA
CCGTCGCGGA TCCTGGCCGT GACGCCGGTC AGCCACACTG TCGGCTTCAT GGTCGACTCC
GTCCTCGCCG CCGGTGGCAG CGCGGTCCTG CATGAAGGCT TTGACGCCGG CACCGTGCTC
AGCGATGTCG CCAGGCACCG GATCACCGAT ACCTATCTGG CCGTTCCGCA CCTGTACCGC
CTGGTTGAGC ATGAGGACCT TCCCCGCACC GATGTGTCGT CCCTGCGTCG GCTCATCTAC
AGCGGCACCC CGGCCGCGCC ACGTCGGATC GCGCAGGCGG TCCCCTGCTT CCGCGACGCC
ATCGTCCAGC TCTACGGCAC GACGGAAGCG GGCGGCATCT CCAGCCTCAC GCCGCTGGAC
CACCAGGAGC CCGAACTCCT GCCGACGGTC GGGCGGCCCT TCCCCTGGGT GCAGGTCCGC
ATGTGTGACC CGGACACCGG CGCTGAGGTG GAGCGAGGCC ACGTGGGGGA GGTGTGGACG
TACTCGACAA CAGTGATGGA CGGCTACCTG GAGACCGGCG TCCCCACGCA CAGCACTCTG
CGAGACGGCT GGCTGCGCAC GGGTGACCTC GGCTACTGGG ACCAGTACGG CTATCTGCGG
CTGGTCGGCC GGGTCGGCCA GGTGATCAAG GCCGGCGGAC AGAAGGTTTA CCCGACGGCC
GTCGAGTCGG CGCTCCAGGA GCATCCCGAC GTGCGGCATG CCGTTGTCTT CGGCGTCCAC
GACCGGGACC GGATCGAGCA CGTGCACGCC GCAGTCGTCC TGGCACCCGG TTCCTCGGTC
ACAGACGAGG AGCTGAGTCG CCATGTGGCC GCCACGCTCG ACTCGGCCCA TGCACCTGCG
CACTTCAGCC GATGGGCCGA GATCCCGCTC ACCGCGTACG GGAAACCGGA CCGAGCGTCA
CTGCGTTCCC GAGCGGAGCG GGAAGCGCTG GGCGGCGGGA CAGTGCAGAG AGGAAGGGCA
CTGTGA
 
Protein sequence
MTSYTGAFSL EIPDLWKNHE NCAFRILYGL VQRPNSLVVR WRDREITAEL LAELILNAAE 
TFLQCGAGPR EAIAVLASSN HPAMLVCRYA AHLIGASVVY VRAANPRTDA EMLSRQVQSR
ILEEVGARVL VVDASHIDRG RALVASATTL LTLVTETYPA VPLDTGRTPR LPDLPPYDGD
ARALVTFTSG STGQPKSLSQ SYRTWNATVR GFSGRTDTHL PSRILAVTPV SHTVGFMVDS
VLAAGGSAVL HEGFDAGTVL SDVARHRITD TYLAVPHLYR LVEHEDLPRT DVSSLRRLIY
SGTPAAPRRI AQAVPCFRDA IVQLYGTTEA GGISSLTPLD HQEPELLPTV GRPFPWVQVR
MCDPDTGAEV ERGHVGEVWT YSTTVMDGYL ETGVPTHSTL RDGWLRTGDL GYWDQYGYLR
LVGRVGQVIK AGGQKVYPTA VESALQEHPD VRHAVVFGVH DRDRIEHVHA AVVLAPGSSV
TDEELSRHVA ATLDSAHAPA HFSRWAEIPL TAYGKPDRAS LRSRAEREAL GGGTVQRGRA
L