Gene Francci3_2148 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2148 
Symbol 
ID3905538 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2515435 
End bp2516991 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content70% 
IMG OID637879483 
Producthypothetical protein 
Protein accessionYP_481249 
Protein GI86740849 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTACCG CGGTCGCGGC CCCAGCCACC AAGCAGGCCA TCGGCCCAGG GGCGTCGGGC 
TGGGAGCCGG CCGCGCGGCC GGTCGGCCCG TGGGATACGG CGTCGTTGGA CGACGTGCTC
GCGGCTGTGC CGACCTTGTC GTGGTGGAAC GCGAAGCGGC AGCAGGCGCA GCGGTCGGTT
GCCGGTGTCG AGCTGCTGCT GGACTGGCTG GGCCGGGTGC CCGGCGGCTG GCAGGACCGG
TGGGAGCATG CCGAGGCGAT GCTCGGTCCC AGGTGGGCGA CCTGGTCAGC CGCCATTCCC
GAGCCCGACA GGCCCTGCGG CGAACGCCAT CGCGTGGTGC TGACCCGAGG GCTGGCCTGT
CTGCTGCGGA TGAGGCTCGT CCGGCCGAGC TACCCGTTCC TGACCACCTA CGGGCCGACG
ACGACGTTCG CGGTCGTCCG CGACCTGGTG AGCCCGGAGT TGTTCGCCTG CGCGGCCGAC
GCCGCGCGGG CGTGTGGCGG CCATTCCGAG AACGTGCTCA GGAAGGCGCT GAACGTCCTG
ACCGCGATCG TGATCCATAC CGGTGGCAGC CTGAACGAGG TGACCACAGG CGACCTCCTC
GCGTTCCAGT CAGCCACGGC GCCCAGCCGC GGCCGGACGC GTGACGGTTC CCACCTGGCC
TGGCAGATGA TGGTCGACCT GGGCGTCTTC CCACCGCACT CCACGCTGCA CGCCGCCATG
CGCACCGGCC CGCAGAGCAC CGCCGAACTC GTGGACCGCC ACGGCATCGC CGACCGGCAG
ATCCGCGAGT TGCTGATCCG CTACCTCGAC GAGCGGCGTC CCGCGCTCGA CTACAGCACC
TTCCGCATGC TCGTCGGCCG TCTCGCCGGG GCGTTCTGGG CCGACCTCGA ACGCCATCAC
CCCGGCATCG ACACCCTCAA TCTGCCACGC GAGGTCACGG ATCCCTGGAA GGAACGGCTG
CGTTTTCCCC AGAACGGCGG CAGCCGCCCG ACCCGGCAAC GACACATCGA CACCCTGATC
ACGGTCCGCG CCTTCTACCT CGACATCGCC GAATGGGCAC TGTCGGACCC TTCCTGGGCG
CCTTGGGCGT TCCCCAGCCC GGTGCGCAAG AGCGACACCG CGGGAGTCGT CAAACAGCGC
CGCGCCACAA CGGCCTCGAA CTCGGCGACT GCGGACGCCC CTACGGCACC CCCTGCATCC
ACGAACACGC CTGCGTCCGC TGCCCCATGC TCCGCGTCGA CCCCCGCCAA CGCACCCGCC
TCGAACAGAT CATCCGCAAC CTCGGCGAAC GCATCGAGGA AGCCAAAGCC AACGGCTGGC
AGGGCGAAGT CGAGGGCCTC AAGACCAGCC TCGAAGCCGC ACAGCGCAAG CTCGCCGGCC
TGGACCGCGC AGCCCGCAAC ACCTCCAAAC CAGCACCACT CGGAATGCCA CCAATCCCAC
CCAAAATTTC AAGATCAAAA TCCGGATAAG CGGGTGCCTG GGATATCCCG CGAGCTTGGT
GGGTGCAGCT GGCCGAGGAC GGGCGTCTGG TGGTGCCGCT GCGGATCCTC GGGCTGA
 
Protein sequence
MPTAVAAPAT KQAIGPGASG WEPAARPVGP WDTASLDDVL AAVPTLSWWN AKRQQAQRSV 
AGVELLLDWL GRVPGGWQDR WEHAEAMLGP RWATWSAAIP EPDRPCGERH RVVLTRGLAC
LLRMRLVRPS YPFLTTYGPT TTFAVVRDLV SPELFACAAD AARACGGHSE NVLRKALNVL
TAIVIHTGGS LNEVTTGDLL AFQSATAPSR GRTRDGSHLA WQMMVDLGVF PPHSTLHAAM
RTGPQSTAEL VDRHGIADRQ IRELLIRYLD ERRPALDYST FRMLVGRLAG AFWADLERHH
PGIDTLNLPR EVTDPWKERL RFPQNGGSRP TRQRHIDTLI TVRAFYLDIA EWALSDPSWA
PWAFPSPVRK SDTAGVVKQR RATTASNSAT ADAPTAPPAS TNTPASAAPC SASTPANAPA
SNRSSATSAN ASRKPKPTAG RAKSRASRPA SKPHSASSPA WTAQPATPPN QHHSECHQSH
PKFQDQNPDK RVPGISRELG GCSWPRTGVW WCRCGSSG